Slashdot Mirror


Why Linux Makes Sense for India

"The localisation of Linux to Indian languages can spark off a revolution that reaches down to the grassroots levels of the country," writes Prof. Venkatesh Hariharan. Read the rest of his informative essay Below .

Why Linux Makes Sense for India

Falling costs have made computers more affordable to a larger section of India's population. At the same time, the Internet has made the PC a compelling proposition for fulfilling communications, education, entertainment and information needs. Based on these two trends, the market for Information and Communication Technologies (ICT) is likely to take off significantly in India.

Yet, India faces a peculiar problem in that almost all popular operating systems and applications packages are available only in English, a language which is spoken by a mere ten percent of the population. The lack of "Indianized" software is therefore an issue that seriously hampers the growth of the Indian computer industry. For almost 915 million Indians, the lack of Indian language interfaces is one among many issues that hamper their ability to reap the benefits of information technology. This is creating a new class of people who live in what can be called as "Information Poverty" even as technology becomes cheaper and cheaper.

At the infrastructure level, the barriers to information access are dropping dramatically with new ISPs coming into India and several players jockeying to provide bandwidth and other back-end services. However, without operating systems, applications and Internet content in Indian languages, key benefits of the digital revolution-e-commerce, low cost communication through e-mail, access to information databases, telemedicine services etc are denied to the Indian masses. Giving Internet access to an Indian who does not know a shred of English is like giving someone the keys to a car when there are no roads to drive on!

One development that can help India out of this deadlock is a national-level, collaborative effort to localise Linux to Indian languages.

Linux is a free operating system that has gained phenomenal popularity in recent times because it allows users to modify it to suit their own needs. Linux is a collaborative effort of thousands of programmers interacting over the Internet and is therefore not owned or controlled by any one company. In this article, we outline the economic and cultural imperatives for the localisation of Linux.

Free operating systems have several advantages for developing countries because most software packages today are developed in the west and then sold in developing countries where the parameters of affordability are completely different. The Bangladeshi activist Shahidul Alam expresses these differences poetically when he says, "A modem costs more than a cow." The benefits of free software multiply exponentially when we look at large-scale implementations. The Government of Mexico is estimated to have saved close to $125 million that would otherwise have been spent on proprietary systems when it signed up Red Hat to implement Linux in more than 140,000 schools and colleges across Mexico. In India too, large operators like World-Tel (which plans to have a thousand Internet Centres in Tamil Nadu, with each of them having between two to 20 PCs each) have expressed their intention to go the free software way. The company is negotiating similar deals with several other state governments. Organizations like World-Tel, Internet centres, schools and homes etc. can be expected to be significant users of Indian language operating systems.

The growth of content in platform-independent file formats (HTML, MP3 etc) has also reduced the dependence on a specific operating system, making Linux a viable option.

Apart from these, there are cultural reasons that make Linux attractive. The existing user interface paradigm of files and folders evolved because computers were essentially designed for a western audience familiar with real-life files and folders. There is no reason to assume why the same paradigm should apply to a trader in Tamil Nadu or a farmer in Madhya Pradesh.

The openness of Linux (and other free operating systems like Free BSD) allows local linguistic groups to customise user interfaces in ways that are far more culturally sensitive than any centrally controlled approach. Linguistic groups that may be considered too small a market by vendors can also take their destiny in their own hands by customising the Linux interface to their own needs.

It is therefore clear that Linux is a very attractive long-term solution to India's computing needs.

Localising the user interface of Linux to all the 18 official Indian languages will involve changing the menus and help-text to Indian languages and creating a whole stack of applications and tools (word processors, browsers, spell-checkers etc.) to enable computing in Indian languages.

This is a task that involves both technical and linguistic challenges. For example, should "File" simple be called "File" but written in Indian scripts because it is now a part of popular usage? Or should we find Indian language equivalents? In some cases it makes little sense. For example, how many people know that the Hindi word for computer is "sanghanak"? Or what is the Hindi equivalent for "Internet"? A very sensitive balance has to be struck between practicality and preserving Indian languages. However, Indian linguistic groups will have to wake up to the fact that their languages will become outdated if they do not become a part of the digital age. In fact, the Internet can be one of the finest means of recording, archiving and propagating Indian culture. Since culture is embedded in language to a significant degree, the ability to compute in one's native language can give Indian culture a significant boost.

However, one of the greatest roadblocks to computing in Indian languages has been the lack of widely accepted standards. If millions of people are able to freely e-mail each other, it is because of a widely accepted standard called ASCII (American Standard Code for Information Interchange). It is sad that in spite of claims that India is a software superpower, we cannot harness IT for the benefit our own nation's citizens and the greatest stumbling block is a lack of agreement on standards. Check out ten different Hindi newspapers on the Web to see for yourself. You'll end up downloading and installing ten different fonts that (in most cases) can be used for browsing that one site and nothing else. It is because of this reason that Hindi, despite being one of the largest spoken languages in the world, has a negligible presence on the Web. Informed sources feel that the Unicode standard (which Microsoft has adopted for the upcoming Windows 2000 operating system) will soon become the de-facto standard settling the language standards issue once and for all. If this prediction comes to pass, it will significantly increase the domestic market for hardware, software and services, which is restricted only to a small fraction of India's population that understands English.

There are several initiatives that are underway in order to make this possible. The National Centre for Software Technology has submitted a proposal to the Technology Development in Indian Languages of the Government of India. TheIndian Institute of Technology, Madras has already started work on localising Linux to Malayalam and Tamil. My own institute, the Indian Institute of Information Technology, Bangalore has committed resources to this the "IndLinux" project and started a collaborative effort to realise this goal. IndLinux has attracted the interest of organizations like FreeOS.com and many individuals located around the world.

In conclusion, it has to be said that the Indianisation of Linux is probably one of the most practical ways of making information technology available to millions and millions of Indians. It is now upto linguistic and technical groups to collaborate and make things happen.

-0-

Prof. Venkatesh Hariharan is with the Indian Institute of Information Technology, Bangalore. He can be reached at venky@iiitb.ac.in.

3 of 336 comments (clear)

  1. And this is the way the World is won... by FFFish · · Score: 5
    ...person by person, country by country -- and starting with the corners of the world that are ignored: where there are barriers to language, barriers to affordability, barriers to access, barriers to the type of application that is needed.

    China (slashdot story), Mexico (slashdot story), India... those are significant populations. It's a step in the right direction.

    Remove language barriers: internationalize all parts of Linux.

    Remove affordability barriers: release up-to-date packages that are designed to be useable on old 386 systems. In a lot of countries, Pentium are unaffordium.

    Remove barriers to access: donate your old hardware to third-world countries. Help get Linux distributed -- donate a diskette-set to an emerging-world school.

    Remove application barriers: internationalize applications. Identify what old software (running on DOS, Commodore, other old iron) should be ported to Linux.

    Support the world outside of the little space you inhabit. Think outside the USA, think outside white Europe. Most of the world isn't like you. Look after them, and it'll pay back a hundredfold...

    --

    --

    --
    Don't like it? Respond with words, not karma.
    1. Re:And this is the way the World is won... by Menthos · · Score: 5
      I don't think that everything about an application should be internationalized. It's the UI of the application, and the documentation, that needs translation.

      Internationalization of comments? This would indeed be useful to the non-English world, but I don't think that it's a high-priority thing.
      Many source code comments are not even understandeable by other people than who wrote them, and since the source may change very often (but not necessarily the UI) it would be hard to keep up. This goes also for function and variable names. Translation of these could also very easily break things.
      Not saying that there isn't translation of programming languages out there - an example that comes to mind is the macro language (VBA?) in MS Office applications that is translated (function calls etc.) in localized versions. However, I don't understand the point in this - most programmers or other computer-savvy people in the world know English well enough to be able to read the name (!) of a function call in the documentation and use it. Again, it's the documentation that should be translated, not the function calls, variable names, etc, where translation could break things.

      I'm translating programs myself (GNU utilities into Swedish) and there's a reason why we (translators) have a strict policy not to translate such things as command-line switches. This would break things. So "--verbose" should be left intact in the translated strings, no matter if it is Hindi, Swahili, Esperanto, Dutch or Swedish. But the explaination of "--verbose" in "--help", and the documentation, is translated.

      To sum things up: Since many wonderful programs lack proper UI translation in many languages (not to mention the documentation!), I think this is where the main efforts should be concentrated.
      Also, internationalization is more than just translation. For example, I'd love to have more applications understand localization settings and properly react to them - I hate when an application defaults to inches as a measure, weeks beginning on Sundays, Legal as the paper format, AM/PM clock, "," as thousand delimiter, etc, etc. All things that are NOT the "standard" outside the USA, and things that many programmers overlook. Even worse is when these annoying things are hard-coded - it goes from being merely annoying to a chronic pain, and often makes me refuse to use the program at all.

      So even if programs are translated, there's still more left to do to make it internationalized.

      --

      GNU/Linux. The Freshmaker.

    2. Re:And this is the way the World is won... by ianezz · · Score: 5

      > This brings up a good point: do you internationalize the source code?

      Just my 0.02 Euro here.

      I'm Italian, and I live in Italy. I'm just a C/C++ programmer like many others. The source code I write has all the comments and symbols names in English. This mainly for 2 reasons:

      1) Technical language: programming needs a whole new class of technical terms. Simple words like `formatting', `font', etc. didn't have equivalents in Italian when they were introduced to the mass in the former '80. Simply, we adopted (and distorted) the usual English words. Attempts to create a brand new tech gerg just failed. After 20 years the mass starts using the Italian form for "directory", but still it's a minorty. Just figure yourself when it comes to source code...

      2) Technical documentation: good technical documentation is written in English. Programmers like me avoid translations, just because they make life more complex (the effort is doubled: first you read it, then you try to figure out how the terms used are related to the well known English terms). I'm translating the GNU Emacs manual in Italian in my spare time, and it's a damn difficult work (and it's not at all about programming - figure it yourself when it comes to programming books).

      3) I18n: English is good for symbol names, just because you don't have to do with accents. Italian words just look ugly without accents, and using ISO-8859-1 is still not a viable option for sources... :-)

      4) Last but not least: English is the de facto `lingua franca' for programmers, just like Latin was for the europeans in the passed ages. If you start writing your sources using one of the 18 languages of India for comments and symbol names, probabilities are that only indians will ever peek into them, just taking out the rest of the world. And since most people giving help and advice on programming out there speak English well enough to make one understand them (perhaps except Alan Cox, sometimes ;-), probabilities are that one's best choice is (for now) learn English.

      Of course, I just use Italian in every other context, just like everyone else...

      I hope this is a bit clarifying.