Re: Localized Pages
- From: Christian Rose <menthos menthos com>
- To: gnome-web-list gnome org
- Subject: Re: Localized Pages
- Date: Tue, 28 Nov 2000 18:52:55 +0100
Joakim Ziegler wrote:
Could you try automatic language detection with SourceForge
(http://www.sourceforge.net)? SourceForge uses language preference
detection based on PHP, roughly the same solution that was discussed
here. And yes, SourceForge does this by default.
Sourceforge is in English to me, no matter what I do with IE's language
settings.
I have had no problem with this in Mozilla and Netscape. I suspect your
problem in IE is because SourceForge expects a language code (like
"ll"), while what IE sends might be a full locale (like "ll-CC" where
"ll" is language code and "CC" is country code).
However, this is trivial to fix. For most languages, we don't need a
particular country code (since we don't have specific translation for
that country, just for languages), so it's just a matter of stripping
the last country code part and only use language code. Problem solved.
Not a very advanced programming hack, and it has been done before.
What is more important is that you *did* after all get English as the
fallback, which is what you wanted for fallback. However, if the
language detection had worked for you, you would have gotten a
translated page, no fuss, no mess. If your English knowledge would be
very bad or you didn't understand English at all you would have been
able to read the page, which I think is a big plus compared to not being
able to understand the page at all.
I and obviously others have no problems with this, it works for us, and
as said above, support for IE would be *very* trivial to add.
At this point, it seems like our language detection algorithm is getting
complex, but it's really not. Also, it only needs to be done once, since we
should use a cookie to set the preference.
Actually, I think setting a cookie is only necessary if the user
switched language manually (then we know that he is not satisfied with
the detected language). Only then, we must explicitly remember his
choice. In all other cases the language detection obviously worked and
there is no need to set a cookie.
I still think that it should respect the language setting in the browser
by default. If it is ok for SourceForge, then why not for the GNOME web
site? I think you should try SourceForges language detection and see if
it works for you. If it does, your problem was with the debian.org
implementation of language detection (debian.org uses Apache for
language detection I think), and I don't see a good technical reason at
all not to use language detection by default on the gnome pages to
server content in the detected language.
It doesn't work.
Yes it does. You have only provided *one* example where the results were
"fatal" (you got Polish when you wanted English on the debian.org page).
This was with IE (if I remember correctly), and more importantly,
debian.org uses Apache for determing language. This is probably not at
all what we will do (it seems we will use PHP, and hence we can use our
own code for this) and I don't think that it is relevant since
SourceForge, which uses a PHP solution, doesn't give you a Polish page.
So that problem was specific to the Debian solution, and hardly relevant
to this.
I think picking *one* case like this where it didn't work, a problem
with a specific browser and a specific language and where the problem
probably won't affect our solution to this, and on the same time
disregard that not respecting language settings generally will result in
a lot of people around the world, regardless of browser, won't be able
to understand the site, is a very strange way to prioritize things.
However, it fails for me in a different way than Debian.org
(I always get English).
See above.
It seems to me that language detection is error-prone
and badly supported.
Not at all. It is not more error-prone than we make it. We can make the
language detection exactly like we want it, and as good as we want it.
On the other hand, that is no good if we don't use it, which is what
this discussion is about.
On the other hand, people *expect* to get sites in
English when they go to a site with a generic TLD, such as .org, unless the
site is something specific to a certain country or language.
I disagree. The only thing that would make a user expect English when he
goes to microsoft.com is that he has been there before and it was in
English at that time. People, in my experience, go to .com, .net and
.org domains because they suspect these are the official sites, and they
do want to go to the official site, but not because of a particular
language preference.
For example, .nu is very popular, it is the country code for the island
of Niuawi (sp?) but that doesn't make the content on the majority of .nu
domains in the language of Niuawi, or people expecting it that way.
.com, .org or .net and other TLDs aren't bound to US, and there is no
rule that says that content on those TLDs should be in English. They
often are, but not always. Assuming that those common TLDs should be
reserved for English is very broken.
A lot of countries have *very* strict domain name rules for the country
TLD, so that it is in fact impossible to register the domain that you
want in that TLD if you're not a company registred in that country and
can prove that you have a company name or trade mark like that domain.
Sweden has such rules for example, that is why we for example can't
register gnome.se, because we can't afford to start a company in Sweden
with that name just to get that domain. As you would assume, .com and
.net are popular domain TLDs for Swedish sites, and sites just targeted
to Swedish visitors.
I know this is true in other countries too, it has happened more than
one time that I've gone to a .com just to find a site in Japanese,
targeted just to japanese visitors.
Another reason .com is popular is because this is what all browsers will
default to. If you just enter "something", it will try to go to
"www.something.com". Of course, this is attractive to people around the
world when setting up a site, no matter the language, but this assumes
that you have registered something.com.
The result is that there are *a lot* of .com-like domains out there that
are targeted to other audiences than those speaking English, and they
get more and more all the time. The reason why you might experience that
.com, .net and .org domains are targeted to English-speaking people is
that it is only advertisements for those sites you will see if you live
in an English-speaking country. As a person who doesn't live in such a
country, I can tell you that that assumption is not at all clear here -
you have a lot of advertisement and targeting for local sites using
.com, .net, .org, and .nu, all sites targeted to the local language. So
for an international visitor, he might not at all expect an English site
when he hears about GNOME and goes to www.gnome.org to find out about it.
Automatic language detection is not used by any large, successful commercial sites,
Why does it have to be commercial to count as an example? SourceForge is
a large web site, but it isn't commercial, and localization obviously
works for them.
such as Yahoo!, Apple, Microsoft, etc., they all rely on subsites (or sites
under the relevant TLDs, which we can also do if someone wants to pay the
registration), and that doesn't seem to scare away any users.
The problem is that in some countries you can't get a domain in that TLD
even if you pay an enormous amount of money. You'll have to have a
company with that name registered in that country too before they will
even look at your domain name application. Microsoft, Yahoo!, Apple,
Adobe etc can afford setting up local companies, but I doubt the GNOME
project can.
And microsoft.CC (CC=country code) still isn't intuitative to most
people, that's why they do an enormous amount of advertising on
microsoft.com to attract people to the local sites. A construct like
www.se.gnome.org is even less intuitative to someone just trying to
guess where the GNOME site is. That is why localization of the main site
is so important.
On the other hand, presenting a page in Polish or whatever when people expect
English *is* a potentially fatal error, because people aren't used to Polish
being the standard language, and as such are much more prone to thinking "Oh,
this is a Polish project, it has nothing to do with me", and leave forever.
And how should this be more common than a person coming to gnome.org to
learn about GNOME, thinking "oh, this is only in English" and leave,
just because he didn't realize that GNOME isn't at all reserved to just
English, and that GNOME *is* availiable in his language by default, but
the web page isn't?
You can reverse the scenario like this over an over; an English-speaking
user getting the wrong page because of a bug and thus loose interest in
using GNOME, or an international visitor getting a page in English, a
language that he doesn't understand well or not understand at all, and
thus loose interest in using GNOME, but I think the latter would be
*much* more common than the former scenario.
The former is caused by a rare bug that only affects some of the
visitors and that is fixable, the latter is caused by a bad site design
for international visitors and will affect all international visitors.
Christian
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]