About the SEAlang Library Malay Resources
SEAlang's Malay dictionary is primarily based on the works of Sir Richard James Wilkinson. |
--
A Malay-English Dictionary (romanised), 1932, Mytilene, Greece: Salavopoulos and Kinderlis. 1,288 pages,
26,000 plus heads, nearly 12,000 references, 21,500 etymological notes, 9,000 plus citations of authorities, 4,000
formal species references.
|
--
An Abridged Malay-English Dictionary (romanised), Third Edition, 1926, Singapore: Kelly & Walsh.
2,500 etymological notes, 11,000 heads, 3,500 derivatives, 4,000 compounds.
|
A less formal contemporary work is included to help ensure that modern spellings and abbreviations can be found: |
--
Bhanhot's Malay-English Dictionary is a pioneering (1996) effort by Dr. Diljeet Kumar Bhanot.
It contains roughly 6,000 heads and 3,500 subheads. A former chairman of the Malaysian Medical Association,
Selangor, Dr. Bhanot is in private practice; see www.bhanot.net.
|
Like other Austronesian languages, Malay consists mostly of
lemmas (that is, dictionary headwords) that have been prefixed, suffixed, or
reduplicated (lemmas, unlike roots, can generally stand alone).
Traditionally, all forms are listed under the lemma,
which can make it difficult to find unfamiliar words.
We use a variety of techniques to automatically find useful information, including this
rule-based lemmatiser
developed by Timothy Baldwin and Su'ad Awab.
A note on transcription and tagging of the "Unabridged" Normally the SEAlang Lab will individually tag all usable information, including compounds, references, loan sources, etc. Inconsistencies and idiosyncracies in the original publication (for example, abbreviations are often context-dependent, and thousands of Latin terms and loan sources are printed with spaces between each letter) made this impossible for the Wilkinson Unabridged. |
We have put our efforts into tagging the parts of the text that
seem most useful to the scholarly community, and provide entry-by-entry
links to the original text image to allow immediate checking of the transcription (please bring
any errors to our attention).
If anybody wishes to pursue developement of an edited, critical, or updated edition
of Wilkinson, we would be delighted to assist.
Wilkinson's Dictionaries Encyclopediaic in scope, Wilkinson's 1932 edition is the culmination of decades of work on Malay, which included a first 'unabridged' (of about half its size) published in 1901-1902. |
Wilkinson's orthography presents a slight problem. For much of the 20th century Wilkinson established the standard for romanization, enabling the transition from the previously universal Arabic-based Jawi script. A series of spelling reforms, culminating in the 1972 harmonization of Malay and Indonesian, has left Wilkinson slightly out of step with modern practice. As a rule, Wilkinson's ch = modern c, his ĕ (which reflected ə, and which you can enter with E) = e, and - (for hyphenation of affixes) is not used. |
We provide a checkbox (under Match) that allows for approximate matching of these.
However, other modern alternations (e.g. o/u and e/i) are not handled
automatically at this time; please contact us if you'd like to help gloss Wilkinson's spelling.
Display Please note the alternatives for display on the left: |
--
show self only shows the headword or derived form that matched (try beranak).
|
--
show self/parent shows the lemma of the (derived) form as well (try beranak).
|
--
show family shows the lemma and all derived forms (try anak).
Searching Searches may: |
--
match a single character with ?, and zero or more
with *. Thus, ca?ang matches cabang, cadang, canang, while ca*ang matches
cadangan, cawangan, campur tangan and many others.
|
--
require matches for both, or either, Indonesian text (of the headword),
or English text (in the definition).
|
--
be limited to a particular etymology, usage, type, or subject.
|
Predictive completion Dictionary heads will appear as you type. Short lists will include multi-word entries and examples (compare ban to band). |
Lemmatization and "smart" search To help locate phrases and derivatives, search will automatically: |
--
search examples for an instance of the word if it isn't found in a head (try anakan).
|
--
next, lemmatize the word, and search for the root.
|
--
In case of a multi-word phrase, we automatically search examples, then search
for the phrase as a list of separate words, then search for the list of roots.
|
The English search term can be expanded (default)
to include inflected forms (a search for sing matches
sings, singing, sang, sung as well).
Approximate search for Javanese means that we ignore diacritics,
glottal marks (the ' ), and hyphens in compounds.
|