Home About Nooj
About Nooj

 

NooJ is a freeware, linguistic engineering development environment used to formalize various types of textual phenomena (orthography, lexical and productive morphology, local, structural and transformational syntax) using a large gamut of computational devices (from Finite-State Automata to Augmented Recursive Transition Networks). NooJ includes tools to construct, test, debug, maintain and accumulate large sets linguistic resources, and can apply them to large texts.
Modules for a fifteen languages are already available for free download: Arabic, Armenian, Bulgarian, Catalan, Chinese, English, French, Hebrew, Hungarian, Italian, Polish, Portuguese and Spanish. A dozen other modules are under construction.

 

NooJ's most exclusive characteristics are:

* NooJ can process texts and corpora in over 100+ file formats, including HTML, PDF, MS-OFFICE, all variants of UNICODE, ASCII, etc. It can import information from, and export its annotations back to XML documents.
* NooJ's linguistic engine uses an annotation system that allows all levels of grammars to be applied to texts without modifying them; this allows linguists to formalize various phenomena independently, and to apply the corresponding grammars in cascade.

For instance, by combining inflection, derivation and syntactic data, NooJ can perform Harris-type transformations.
NooJ is used as a linguistic engineering development platform, a corpus processor, an information extraction system, a terminological extractor, an Machine Translation development tool as well as to teach linguistics and computational linguistics.

To learn more about NooJ, download the software, linguistic resources, manual, tutorials and reference papers: www.nooj4nlp.net.

 

sponsoring

mshe.jpg
laseldi.jpg
isims.jpg
isi.jpg
1miracl.jpg

On line

We have 1 guest online