Re: wgo url policy



On Fri, 2006-07-14 at 00:24 +1000, Jeff Waugh wrote:
<quote who="Dave Neary">

Minimal redundancy
Avoid multiple URLs for a given page. Currently both http://gnome.org
and http://www.gnome.org exist. I suggest to forward all gnome.org
access to www.gnome.org. Are there any duplicate pages on wgo now?

These are not duplicate pages, they are aliases. gnome.org is an alias
for www.gnome.org.

That's not what Gergely was pointing out, though the proximity of the two
points makes it easy to misunderstand his intent. First up, he's suggesting
that gnome.org should simply redirect to www.gnome.org, such that all Google
juice and so on points there, and we be very consistent in our use of the
domain name (a very good point - and now fixed, by the way). Additionally,
he was asking if there were duplicate pages (as opposed to domains) on the
site.

Oops, yeah, 2 points in 1 mix :)
First, domain aliases are imho bad, because they introduce
inconsistency. Reasons:
- proxies and browser caches will not realize it's the same thing
- same with indexing engines (Google juice as Jeff points out)
- confusing for (novice?) users
- inconsistent links in documentation, about boxes, etc
- history, visited links in the browser will not work properly 

(btw i noticed gnomejournal.org has this aliasing too)

As a second issue I meant possible duplicate information within wgo.
Perhaps this better belongs to the "content mapping" task.

- I suggest no extensions to file names, this way resources can act as
containers
for example:
www.gnome.org/start and www.gnome.org/start/2.14
instead of
www.gnome.org/start.html and www.gnome.org/start/2.14 

I don't think this is really an issue. Direct linking to a html page is
common, having one directory per page (which is what would result) is
unwieldy, and (as you point out) .odf, .pdf, .jpg, etc - files have
extensions, those extensions convey some information. I don't think we
need a policy for this.

I do think this is an issue, and one that I have pushed as much as possible
on wgo as it stands today. It is significantly clearer and more future proof
to have a directory with an appropriately named index file in it than to be
referencing specific files all the time. I 100% support the *continuation*
of this policy. :-)

This important, since one of our goals is to increase consistency and
interoperability between individual gnome sites. Another goal is to be
CMS independent as much as possible, so we can possibly swap the
underlying infrastructure if a better comes along, without breaking any
links. Properly thought-after URL naming can abstract away
implementation details. I applaud the current wgo scheme :)

I could not yet decide what I think is the best URL scheme for I18N.

I think the best is no URL scheme for languages and use the browser
preferences.

If you want to let people change the displayed language if the negotiated
one is incorrect or not desireable, you have to select some kind of URL
scheme. I don't think this needs to be fully defined ahead of time - the
scope for this will be defined by the technology we choose, though we can
certainly set some parameters. For instance: we can't use subdomains for
language codes, because some subdomains are already in use by local GNOME
groups; we probably shouldn't be screwing up our URLs by sticking language
codes in there; perhaps the query string approach is the best (can you use a
query string to override Apache content negotation already?).


I think the URL should be independent of the underlying technology, see
my point above. I just hope we can pull it off :) Good point on not
using subdomains, they can be reserved for local content (for local
groups and their local content, not the same general content translated
into a bunch of languages).

Like Jeff, I also tend to think that query string approach is the
cleanest. It implies it's the same resource with a different parameter,
that is "display language". Can be problematic though with some CMS
engines and usual URL rewriting rules (that for example would
translate /foo/bar/baz?lang=hu into /index.php?page=foo/bar/baz?lang=hu
which is obviously wrong: ? appears twice).

I guess we all agree, that a page in a specific language needs it's own
URL?



I realize I'm quite anal about getting the URL scheme right :) But I
think URLs are a very important and so far neglected aspect of proper
web presence (I don't mean the gnome web, but in general). For more info
see:
http://live.gnome.org/GnomeWeb/PlatformIssues
http://www.w3.org/Provider/Style/URI
http://www.w3.org/TR/webarch/
http://www.gbiv.com/protocols/uri/rfc/rfc3986.html
or even
http://www.ics.uci.edu/%7Efielding/pubs/dissertation/top.htm

cheers,
Greg




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]