Kernerman Dictionary News • Number 12 • July 2004
Microsoft and Dictionary Publishers:
Charles M. Levine’s ‘The Coming Boom in English Lexicography: Some
Thoughts about the World Wide Web (Part One)’, appeared in Kernerman
Dictionary News, Number 9, July 2001 (http://kdictionaries.com/newsletter/kdn9-1.html).
Commitment to lexical data
Microsoft, as rightly identified by Esposito, “views lexical databases as an aspect of strategic technology”, and thus perceives such data in the broadest sense: the definitions and translation equivalents which are central to printed reference dictionaries are only part of the lexical data that Microsoft utilises. Information about spelling, pronunciation and grammar is perhaps even more important in many of Microsoft’s products. Whether in established businesses like Office or new areas such as Tablet PC or MS Reader, we make extensive use of wordlists – in look-up dictionaries, yes, but also in spellers, handwriting recognizers, and search and speech recognition engines, to give just some examples. Our requirements for lexical data will continue to develop, as we develop new products and add further localized languages to existing products. For Office 2003, for instance, we have added new localized versions for Catalan and Nynorsk.
In future, we should expect to see an increasing range of applications which make use of lexical data. Already, these include spellers in products such as Office, Works, Outlook Web, Access and Hotmail, as well as reference dictionaries (both monolingual and bilingual) which may be accessed in the Encarta Reference Library, online on MSN or through the new Research Pane in Office 2003.
Partnership in development
The parallel that has been drawn between the development of lexical data and Netscape is, I feel, misleading: it is technology – whether for Internet browsing or search – and not data itself which is central to Microsoft’s business. Developing our own lexical content (across more than 40 languages at that) is simply not a part of the company’s core mission. Why should we seek to develop so many dictionaries ourselves when excellent resources already exist, developed over many years by a range of dictionary publishers? Those resources, moreover, are already available, whereas any new dictionary would require several years’ work to create ab initio.
In all other cases, Microsoft has developed its lexical tools with the help of third party specialists in lexical data. These may be publishers with an established background in print-based reference publishing or newer independent software vendors (ISVs) who have built their businesses supplying lexical data to general software houses such as Microsoft.
Continuing our commitment to partnership
As we look to the future, Microsoft sees more, not fewer, opportunities for publishers to provide lexical data to work with our technologies:
Secondly, we are increasingly creating new opportunities for publishers to develop and market themselves additional products which integrate with our core applications. Examples of these already include the add-on spellchecker files for Office in areas such as law, medicine or economics, which exist today for Dutch, French and Italian Or again the Translation Dictionaries technical article (a form of software development kit) for the bilingual dictionaries in Office 2003: this article – which is available free-of-charge – enables publishers of bilingual dictionaries to adapt their existing content and sell it as a module which is fully integrated in Office (see http://www.microsoft.com/downloads/details.aspx?FamilyId=38934F90-FB06-4ABF-ABA5-94D16BF813BB&displaylang=en).
Dictionaries and other
lexical data remain a strategic investment for Microsoft, but one that we
believe is based on partnership, not exclusion, an opportunity for dictionary
publishers and not a threat.
Dictionary tools in Microsoft products
Microsoft Office 2003
Ø The speller, thesaurus and grammar checker are already an established part of the Proofing Tools in Office.
Ø It is possible to add support in more than 40 available languages.
Ø The Language Auto-Detect feature in Word automatically recognizes after a few words the language being used and will switch the speller, thesaurus and grammar checker to that language. Alternatively, the language itself can be specified.
Ø Words that are not included in the standard speller (e.g. specialized terms or company names) can be added to the user’s custom dictionary in each individual Office configuration.
Ø For French, Dutch and Italian, additional spellchecker files covering specialist vocabulary for science, law, medicine, IT and economics can be downloaded and integrated into the existing speller
Ø Access to a range of research and reference information is offered without leaving the Office application. The dictionaries available include:
· Encarta World English Dictionary, developed in association with Bloomsbury Publishing, with 100,000 headwords (US and UK versions)
· Encarta French Dictionary, built specifically for Microsoft by a development team in France, 45,000 headwords
· German monolingual dictionary, produced by a leading German dictionary publisher, with 57.000 headwords
· Bilingual dictionaries for English to and from several languages including French, German, Italian and Spanish
More to come
Ø In 2004-2005 Microsoft will be adding localized versions of Office for many languages, with spellchecking support provided in many cases. These new versions will extend its coverage of the languages of the new member states of the European Union (with additions such as Maltese) and beyond (for languages such as Macedonian and Afrikaans). Discussions are also under way to offer further specialized spellcheckers for other languages.
Encarta Reference Library 2005
Ø It is possible to consult a dictionary without opening up Office, the same dictionary content is available in the latest versions of Encarta, featuring one-click access to definitions, synonyms and translations (the exact mix varies by language).
Ø Encarta Reference Library 2005 is available for English (in US and UK editions), French, German, Spanish, and Dutch. For English, French and German, Microsoft offers the same dictionaries as in the Microsoft Office 2003 Research Service; for Spanish, the prestigious dictionary of the Real Academia de la Lengua Espanola is included, while for Dutch the dictionary is provided by Het Spectrum, publishers of the Prisma and Kramers dictionary ranges.
Dictionaries for Pocket PC
The Microsoft dictionaries available in MS Reader format can be downloaded directly to Pocket PC using Active Sync, including a specially shortened version of the Encarta World English Dictionary (with concise definitions in English) and bilingual dictionaries for English to and from French, German, Italian and Spanish: http://www.microsoft.com/reader/downloads/dictionaries.asp.
Ø It is possible to access the Encarta World English Dictionary on the Internet http://encarta.msn.com/encnet/features/dictionary/dictionaryhome.aspx.
Computer dictionaries in print
Ø Microsoft Computer Dictionary covers computer and IT terminology in English, over 10,000 entries. For details see: http://www.microsoft.com/mspress/southpacific/books/book19087.htm.
Ø The same in German:
Comment from Joseph J. Esposito:
“Mr. Parish does not respond to my piece at all. Microsoft’s intentions are irrelevant; what is important are its effects, an ineluctable outgrowth of Microsoft’s position in the marketplace. Mr. Parish’s illustrations all serve to confirm my thesis.”
K Dictionaries Ltd