Discussion:
I-D Action: draft-ietf-iri-bidi-guidelines-02.txt
i***@ietf.org
2012-03-09 18:09:07 UTC
Permalink
A New Internet-Draft is available from the on-line Internet-Drafts directories. This draft is a work item of the Internationalized Resource Identifiers Working Group of the IETF.

Title : Guidelines for Internationalized Resource Identifiers with Bi- directional Characters (Bidi IRIs)
Author(s) : Martin Duerst
Larry Masinter
Adil Allawi
Filename : draft-ietf-iri-bidi-guidelines-02.txt
Pages : 10
Date : 2012-03-09

This specification gives guidelines for selection, use, and
presentation of International Resource Identifiers (IRIs) which
include characters with inherent right-to-left (rtl) writing
direction.


A URL for this Internet-Draft is:
http://www.ietf.org/internet-drafts/draft-ietf-iri-bidi-guidelines-02.txt

Internet-Drafts are also available by anonymous FTP at:
ftp://ftp.ietf.org/internet-drafts/

This Internet-Draft can be retrieved at:
ftp://ftp.ietf.org/internet-drafts/draft-ietf-iri-bidi-guidelines-02.txt
Pete Resnick
2012-04-08 17:12:39 UTC
Permalink
X-IronPort-AV: E=McAfee;i="5400,1158,6674"; a="179773738"
Received: from ironmsg04-l.qualcomm.com ([172.30.48.19])
by wolverine01.qualcomm.com with ESMTP; 08 Apr 2012 10:12:42 -0700
X-IronPort-AV: E=Sophos;i="4.75,389,1330934400";
d="scan'208";a="192833338"
Received: from nasanexhc07.na.qualcomm.com ([172.30.39.190])
by Ironmsg04-L.qualcomm.com with ESMTP/TLS/AES128-SHA; 08 Apr 2012 10:12:42 -0700
Received: from resnick2.qualcomm.com (172.30.39.5) by qcmail1.qualcomm.com
(172.30.39.190) with Microsoft SMTP Server (TLS) id 14.2.283.3; Sun, 8 Apr
2012 10:12:41 -0700
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.1.9) Gecko/20100630 Eudora/3.0.4
In-Reply-To: <***@ietfa.amsl.com>
X-Originating-IP: [172.30.39.5]
Received-SPF: pass client-ip=199.106.114.254; envelope-from=***@qualcomm.com; helo=wolverine01.qualcomm.com
X-W3C-Hub-Spam-Status: No, score=-4.3
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01
X-W3C-Scan-Sig: maggie.w3.org 1SGvfk-0005oL-EZ cf3bbd8475d265099c5adc397244ec42
X-Original-To: public-***@w3.org
Archived-At: <http://www.w3.org/mid/***@qualcomm.com>
Resent-From: public-***@w3.org
X-Mailing-List: <public-***@w3.org> archive/latest/2085
X-Loop: public-***@w3.org
Resent-Sender: public-iri-***@w3.org
Precedence: list
List-Id: <public-iri.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:public-***@w3.org>
List-Unsubscribe: <mailto:public-iri-***@w3.org?subject=unsubscribe>
Resent-Message-Id: <E1SGvfo-0007Ok-***@frink.w3.org>
Resent-Date: Sun, 08 Apr 2012 17:13:08 +0000
Archived-At: <http://permalink.gmane.org/gmane.org.w3c.public-iri/1892>

Greetings,

I'm just starting to review things as your new AD. I noticed that the
header of this document says that its intended status is "Best Current
Practice". I believe that is incorrect. It should be "Proposed
Standard". Please change this, or give a very compelling reason that
this is not on the standards track. If it is not on the standards track,
I will argue vociferously for it being "Informational". It is certainly
not appropriate for BCP, which is (many exceptions notwithstanding) for
IETF process and procedures.

I think standards track is appropriate; it gives protocol rules for the
use of Bidi in IRIs.

pr
Post by i***@ietf.org
A New Internet-Draft is available from the on-line Internet-Drafts directories. This draft is a work item of the Internationalized Resource Identifiers Working Group of the IETF.
Title : Guidelines for Internationalized Resource Identifiers with Bi- directional Characters (Bidi IRIs)
Author(s) : Martin Duerst
Larry Masinter
Adil Allawi
Filename : draft-ietf-iri-bidi-guidelines-02.txt
Pages : 10
Date : 2012-03-09
This specification gives guidelines for selection, use, and
presentation of International Resource Identifiers (IRIs) which
include characters with inherent right-to-left (rtl) writing
direction.
http://www.ietf.org/internet-drafts/draft-ietf-iri-bidi-guidelines-02.txt
ftp://ftp.ietf.org/internet-drafts/
ftp://ftp.ietf.org/internet-drafts/draft-ietf-iri-bidi-guidelines-02.txt
--
Pete Resnick<http://www.qualcomm.com/~presnick/>
Qualcomm Incorporated - Direct phone: (858)651-4478, Fax: (858)651-1102
John C Klensin
2012-04-08 17:20:49 UTC
Permalink
--On Sunday, April 08, 2012 12:12 -0500 Pete Resnick
Post by Pete Resnick
Greetings,
I'm just starting to review things as your new AD. I noticed
that the header of this document says that its intended status
is "Best Current Practice". I believe that is incorrect. It
should be "Proposed Standard". Please change this, or give a
very compelling reason that this is not on the standards
track. If it is not on the standards track, I will argue
vociferously for it being "Informational". It is certainly not
appropriate for BCP, which is (many exceptions
notwithstanding) for IETF process and procedures.
Pete, while I'm a strong believer in Applicability Statement
handling for documents like this one, I hope the BCP will
continue to remain available for descriptive documents that
really do describe established and tested "best" practices, not
just IETF process and procedure documents (which, IMO, should
eventually really be in a category by themselves).
Post by Pete Resnick
I think standards track is appropriate; it gives protocol
rules for the use of Bidi in IRIs.
Agreed.
john
Pete Resnick
2012-04-08 21:17:08 UTC
Permalink
X-IronPort-AV: E=McAfee;i="5400,1158,6674"; a="177492997"
Received: from ironmsg04-l.qualcomm.com ([172.30.48.19])
by wolverine02.qualcomm.com with ESMTP; 08 Apr 2012 14:17:13 -0700
X-IronPort-AV: E=Sophos;i="4.75,389,1330934400";
d="scan'208";a="192922683"
Received: from nasanexhc04.na.qualcomm.com ([172.30.48.17])
by Ironmsg04-L.qualcomm.com with ESMTP/TLS/AES128-SHA; 08 Apr 2012 14:17:13 -0700
Received: from resnick2.qualcomm.com (172.30.48.1) by qcmail1.qualcomm.com
(172.30.48.17) with Microsoft SMTP Server (TLS) id 14.2.283.3; Sun, 8 Apr
2012 14:17:12 -0700
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.1.9) Gecko/20100630 Eudora/3.0.4
In-Reply-To: <***@PST.JCK.COM>
X-Originating-IP: [172.30.48.1]
Received-SPF: pass client-ip=199.106.114.251; envelope-from=***@qualcomm.com; helo=wolverine02.qualcomm.com
X-W3C-Hub-Spam-Status: No, score=-4.3
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01
X-W3C-Scan-Sig: lisa.w3.org 1SGzUQ-0006bQ-BZ 21a2ade3e7ffd89140dc2f05a0f1f6c9
X-Original-To: public-***@w3.org
Archived-At: <http://www.w3.org/mid/***@qualcomm.com>
Resent-From: public-***@w3.org
X-Mailing-List: <public-***@w3.org> archive/latest/2087
X-Loop: public-***@w3.org
Resent-Sender: public-iri-***@w3.org
Precedence: list
List-Id: <public-iri.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:public-***@w3.org>
List-Unsubscribe: <mailto:public-iri-***@w3.org?subject=unsubscribe>
Resent-Message-Id: <E1SGzUU-0002TW-***@frink.w3.org>
Resent-Date: Sun, 08 Apr 2012 21:17:42 +0000
Archived-At: <http://permalink.gmane.org/gmane.org.w3c.public-iri/1894>
Post by John C Klensin
--On Sunday, April 08, 2012 12:12 -0500 Pete Resnick
[...]It is certainly not
appropriate for BCP, which is (many exceptions
notwithstanding) for IETF process and procedures.
Pete, while I'm a strong believer in Applicability Statement
handling for documents like this one, I hope the BCP will
continue to remain available for descriptive documents that
really do describe established and tested "best" practices, not
just IETF process and procedure documents (which, IMO, should
eventually really be in a category by themselves).
This is obviously not the place for the more general conversation, but
in summary: I have seen scant few documents produced in this
organization (at least since the disappearance of the User Services
area) that described established and tested "best" practices *and* were
not a proposal for a standard way of doing things over the Internet, one
that could be improved with incremental deployment and implementation
experience and could eventually become a protocol standard. There are
examples of local configuration descriptions and the like that could
reasonably fit into this category, but they are relatively uncommon in
the IETF.

In any event, with regard to this document...
Post by John C Klensin
I think standards track is appropriate; it gives protocol
rules for the use of Bidi in IRIs.
Agreed.
...it appears that we agree that this isn't one of them.

pr
--
Pete Resnick<http://www.qualcomm.com/~presnick/>
Qualcomm Incorporated - Direct phone: (858)651-4478, Fax: (858)651-1102
Larry Masinter
2012-04-12 07:27:27 UTC
Permalink
i agree, my changing the status was a mistake.

That said, I think it would help a lot to identify the roles for which conformance behavior is being standardized. I suspected that sometimes we were talking about IRI selection, sometimes about IRI processing, sometimes about Iri display, and rarely about restrictions on IRI syntax.


-----Original message-----
From: Pete Resnick <***@qualcomm.com>
To: John C Klensin <john-***@jck.com>
Cc: "public-***@w3.org" <public-***@w3.org>
Sent: Sun, Apr 8, 2012 21:18:33 GMT+00:00
Subject: Re: I-D Action: draft-ietf-iri-bidi-guidelines-02.txt
Post by John C Klensin
--On Sunday, April 08, 2012 12:12 -0500 Pete Resnick
[...]It is certainly not
appropriate for BCP, which is (many exceptions
notwithstanding) for IETF process and procedures.
Pete, while I'm a strong believer in Applicability Statement
handling for documents like this one, I hope the BCP will
continue to remain available for descriptive documents that
really do describe established and tested "best" practices, not
just IETF process and procedure documents (which, IMO, should
eventually really be in a category by themselves).
This is obviously not the place for the more general conversation, but
in summary: I have seen scant few documents produced in this
organization (at least since the disappearance of the User Services
area) that described established and tested "best" practices *and* were
not a proposal for a standard way of doing things over the Internet, one
that could be improved with incremental deployment and implementation
experience and could eventually become a protocol standard. There are
examples of local configuration descriptions and the like that could
reasonably fit into this category, but they are relatively uncommon in
the IETF.

In any event, with regard to this document...
Post by John C Klensin
I think standards track is appropriate; it gives protocol
rules for the use of Bidi in IRIs.
Agreed.
...it appears that we agree that this isn't one of them.

pr
--
Pete Resnick<http://www.qualcomm.com/~presnick/>
Qualcomm Incorporated - Direct phone: (858)651-4478, Fax: (858)651-1102
John C Klensin
2012-04-12 13:46:40 UTC
Permalink
--On Thursday, April 12, 2012 00:27 -0700 Larry Masinter
Post by Larry Masinter
...
That said, I think it would help a lot to identify the roles
for which conformance behavior is being standardized. I
suspected that sometimes we were talking about IRI selection,
sometimes about IRI processing, sometimes about Iri display,
and rarely about restrictions on IRI syntax.
And, fwiw, I think that is another symptom of lack of broad
consensus about whether IRIs are part of UI/presentation
interfaces or are protocol elements. Proceeding simultaneously
as if they are both, or whichever is convenient at the moment,
does not make it easy to think clearly about those role, much
less identify them.

That is also where the larger issue that I've been asked to
defer intrudes. To illustrate briefly, I note that the IETF has
never been foolish or arrogant enough to try to tell UI
designers what they can or cannot do. Now suppose that I'm a
designer of a UI for a language that is very different from
Western European ones and that uses a script that is very
different from the Greek-Latin-Cyrillic group. I conclude
that, for my users, IRIs are a completely unreasonable
user-presentation and user-input form, whether because of issues
specific to IRIs or because I've concluded that long-tailed,
multiple-element URIs are not an acceptable user-presentation
and user-input form completely independent of i18n issues. It
doesn't make much difference to this hypothetical case whether
my issues are relatively small (e.g., presentation of method
names, order of labels in domain names) or more complex (e.g.,
moving to a completely standardized and explicit tagged metadata
format).

The question then is whether I layer my presentation format on
top (and translate to) IRIs or layer it directly on top of URIs.
If the answer is the second, then we better not do anything with
IRIs (or with HTML, XML, HTTP, etc.) that requires the first.
And, if it is the first, then we need to be really, really,
clear about canonical forms as well as about why such an
interface should have to go through two translation/mapping
steps at least most of the time.

john
Larry Masinter
2012-04-13 14:42:51 UTC
Permalink
And, fwiw, I think that is another symptom of lack of broad consensus about whether IRIs are part of UI/presentation interfaces or are protocol elements.
I'm certain that the route we've gone down forces IRIs to be protocol elements in some circumstances, if only because IRIs (and not just URIs) appear within languages used in network protocols. I'm not sure I have heard anyone actually advocating for trying to confine IRIs merely to a presentational role, except as wishful thinking. I don't think you (John Klensin) are even advocating that position, are you?

I wish we could, but I don't think we can do so, because it's not how the web works.
Proceeding simultaneously
as if they are both, or whichever is convenient at the moment,
does not make it easy to think clearly about those role, much
less identify them.
Surely with IDNA we have some history of distinguishing between the roles of protocol element vs. presentational interfaces already, which should provide ample groundwork for thinking about them.

In fact, I have tried, in recent edits to the various drafts, to clarify that IRIs are protocol elements, and that there are also presentational forms ("printed on the side of the bus") which are not, themselves, IRIs, but presentations of them. If there is any part of any current document which you think isn't clear on this point, could you let us know so we can open a ticket and fix it?
That is also where the larger issue that I've been asked to defer intrudes. To illustrate briefly, I note that the IETF has never been foolish or arrogant enough to try to tell UI
designers what they can or cannot do.
I think there are numerous cases where IETF documents give advice for "user agents" (especially in "security considerations") as guidance for how a user interface might help avoid user confusion, although such advice is often couched in a "SHOULD". And I think most of the issues we're concerned (and for which we are giving advice) are ones where user confusion and associated security risks are the motivation for going into details in the IRI documents.
Now suppose that I'm a designer of a UI for a language that is very different from Western European ones and that uses a script that is very different from the Greek-Latin-Cyrillic group.
I think our obligations in IRI are quite similar to those in IDNA (and, in fact, some of the concerns expressed about IRIs are more appropriate for IDNA).
I conclude that, for my users, IRIs are a completely unreasonable user-presentation and user-input form, whether because of issues specific to IRIs or because I've concluded that long-tailed, multiple-element URIs are not an acceptable user-presentation and user-input form completely independent of i18n issues.
I think this is quite likely in some cases, and seems to be relevant even for ASCII: people use URL shorteners or expect search engines to provide a better user interface naming and discovery.
It doesn't make much difference to this hypothetical case whether my issues are relatively small (e.g., presentation of method names, order of labels in domain names) or more complex (e.g., moving to a completely standardized and explicit tagged metadata format).
I don't understand why we need to have hypothetical use cases when we have ample real ones.
The question then is whether I layer my presentation format on top (and translate to) IRIs or layer it directly on top of URIs.
It seems clear enough ot me that in many use cases, the protocols traffic in representations that include IRIs as protocol elements (HTML, XML-based languages), but in some use cases, the protocols send URIs, even when the presentation layer displays things that look like IRIs and encourage users to enter things that look like IRIs. An application designer often needs to deal with the URIs and IRIs protocol elements both and also the presentational forms of both, and the translations between those.
If the answer is the second, then we better not do anything with IRIs (or with HTML, XML, HTTP, etc.) that requires the first.
I don't think designing IRIs as a presentational form of URIs is feasible, because the IRI -> URI mapping is information lossy, and there are protocols, formats, languages that treat an IRI and the URI to which it might be converted as distinct. So "the answer" cannot be "layer directly on top of URIs", much as we wish it could.
And, if it is the first, then we need to be really, really, clear about canonical forms as well as about why such an interface should have to go through two translation/mapping
steps at least most of the time.

I'm all in favor of being clear, and would ask again if you could point out places in the documents that are not. I don't think it is feasible to handle our difficulties by being clear about "canonical forms", because different IRI applications have different needs for equivalence relationships, and often in a way that is not easily characterized by defining a canonical form.

As for te fact that interfaces which map between protocol and presentation might have to deal with both URI and IRI protocol elements -- well, they have to do that because the world is complex. I wish it weren't, but that's what we need to do, so let's get on with it.

Regards,

Larry
--
http://larry.masinter.net
Dave Thaler
2012-04-13 22:14:04 UTC
Permalink
And, fwiw, I think that is another symptom of lack of broad consensus about
whether IRIs are part of UI/presentation interfaces or are protocol elements.
I'm certain that the route we've gone down forces IRIs to be protocol
elements in some circumstances, if only because IRIs (and not just URIs)
appear within languages used in network protocols.
I would agree with that (i.e., we need to treat them as protocol elements).

[...]
In fact, I have tried, in recent edits to the various drafts, to clarify that IRIs are
protocol elements, and that there are also presentational forms ("printed on
the side of the bus") which are not, themselves, IRIs, but presentations of
them. If there is any part of any current document which you think isn't clear
on this point, could you let us know so we can open a ticket and fix it?
I agree with that direction.

But to be clear draft-ietf-iri-bidi-guidelines should be scoped to be not about
protocol elements at all, but about presentations of protocol elements, and
as such is higher layer than 3987(bis). There are probably places where the
text in draft-ietf-iri-bidi-guidelines needs to be improved to say so explicitly.

But it is basically trying to do what John calls "telling UI designers what they can
or cannot do". Similar issues undoubtedly occur with email addresses
(and other email-address-like identifiers), etc. not just IDNs as Larry mentioned.

-Dave
Bjoern Hoehrmann
2012-04-15 00:51:34 UTC
Permalink
Post by John C Klensin
And, fwiw, I think that is another symptom of lack of broad
consensus about whether IRIs are part of UI/presentation
interfaces or are protocol elements. Proceeding simultaneously
as if they are both, or whichever is convenient at the moment,
does not make it easy to think clearly about those role, much
less identify them.
I would find it very helpful if you could discuss that point without
using the term "protocol element". draft-duerst-iri-00 will be a de-
cade old on Tuesday and it says in the first sub-clause of the first
sentence of the Abstract that IRIs are protocol elements; RFC 3987
does the same. RFC 3987 might be using the term incorrectly, but in
the language of RFC 3987 IRIs are protocol elements by definition.
--
Björn Höhrmann · mailto:***@hoehrmann.de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/
Loading...