|
About the SEAlang Library Maguindanao Text Corpus |
This mononlingual corpus consists of Maguindanao texts extracted from: |
--
A 2.5MB (about 275,000 word) monolingual corpus prepared by the
SAY Project (2006-2008, Computing Research Laboratory, New Mexico State University.
Director: Jim Cowie).
The Maguindanao portion of bitext materials from the same project
are also added.
|
--
Additional materials were extracted from the dialogs of
Maguindanao Dialogs and Drills
(2009, Tenex Racman and R. David Zorc, edited by Jason Lobel).
Usage |
- context searches show how the search target appears in context, taking both leading and trailing collocates (or neighboring words) into account. This search returns a merged list of leading and trailing collocates. |
- collocate searches are better for focusing on the search target's immediate neighbor. This search returns separate lists of leading and trailing collocates. |
- merged view allows for fast switching between collocate and context views. Try brief first - downloaded pages may be very large, and a slow browser may fall behind in displaying the detailed view. The Go! button invokes the brief view. |
- raw contexts show the search word in context without any attempt at analysis or sanity-checking (local segmentation that helps ensure that a real word has been found). |
-
restrict collocates requires (or forbids) all collocates to have at least
one sense with a particular part of speech or usage.
Additional tips |
Because the underlying text corpus may be quite large, results may be taken from a random sample of hits. For common words, this means that sample contexts and exact collocate frequencies will vary from run to run. |
Clicking on a word/collocate with the mouse starts a new search:
yellow
searches for contexts, and
black
searches for collocates.
Copyright notices Maguindanao Dialogs and Drills are copyright 2009 McNeil Technologies, and is used by permission. |