Nemlar logo





Contact:
nemlar@hum.ku.dk


BLARK Definition and BLARK Content
- for written language

Back to BLARK Home

The degree to which the modules are needed is marked by plus signs:
+++ means essential
++ means very important
+ means important.

HLT modules and corresponding written language resources, marked with importance:
(Follow the blue links to see the BLARK Content.)

Resourcs Monolingual
Lexicon
Multi-/bilingual
Lexicon
Proper names Thesauri,
ontologies,
wordnets
Unannotated
corpora
Annotated
corpora
Parallel
Multiling corpora
Multimodal corpora
for
(hand) OCR
Multimodal corpora
for
(typed) OCR
HLT modules                  
Morphological comp.
(infl, deriv., stemm., diacritic, ...)
+++         ++      
Morphological comp.
(statistical)
+         +++      
POS disambiguator/tagger +++   ++            
POS
(statistical)
+   ++            
Diacritizer +++   ++ ++          
Diacritizer
(statistical)
          +++      
Sentence Boundary Detection (punctuation) +++         ++      
Sentence Boundary
(statistical)
          +++      
Named Entity Recognition +++   +++     +      
Named Entity
(statistical)
          +++      
Word Sense Disambig. +++       ++ ++      
Word Sense Disambig.
(statistical)
          +++      
Term extraction +++       +++        
Term extraction (statistical)         +++ +++      
Shallow parsing +++                
Shallow parsing
(statistical)
          +++      
Syntactic analysis comp. +++         +      
Syntactic Anal. comp.
(statistical)
          +++      
Semantic Anal. comp.
(incl. coref.res.)
+++     +++          
Sentence synthesis and generation +++     ++ + ++      
Transfer tool
(Software)
  +++              
Transfer tool
(statistical)
            +++    
Alignment +++ +++         +    
Alignment
(statistical)
            +++    
Grapheme recognition
typed OCR
(statistical)
++       +++       +++
Grapheme recognition
handwritten OCR
(statistical)
++       +++     +++  


Written language applications and corresponding HLT modules, marked with importance:
(Follow the blue link to see the BLARK Content.)

Applications Document
prod.
Summa. Classif. Indexing IE IR/
filtering
MAT MT ASR
Dictation
TTS Dialog
Systems
HLT modules                      
Morphological comp.
(infl, deriv., stemm., diacritic, ...)
+++ +++ +++ +++ +++ ++ +++ +++ ++ +++ +++
POS disambiguator/tagger +++ +++ ++ +++ +++ ++ +++ +++ ++ +++ +++
Diacritizer                   +++  
Sentence Boundary Detection (punctuation) +++ +++     +++ ++ +++ +++   ++ ++
Named Entity Recognition +++ +++ +++ +++ +++ +++ +++ +++   + +
Word Sense Disambig.   +++ +++ ++ +++ ++ ++ +++   +++ +++
Term extraction + +++ +++ +++ +++ +++ +++ +++     ++
Shallow parsing ++ ++ +   ++ + +++ +++   ++ +++
Syntactic analysis comp. ++       ++   ++ +++   + +++
Semantic Analysis
(incl. coref.res.)
  ++ ++ + +++ ++ ++ ++   + +++
Sentence synthesis and generation ++ +++         ++ +++     +++
Transfer tool
(Software)
              +++      
Alignment             ++        


MEDAR is supported by the European Commission's ICT programme and is running from
February 1st 2008 until July 31st 2010

European Flag