Page 233 - DCAP310_INTRODUCTION_TO_ARTIFICIAL_INTELLIGENCE_AND_EXPERT_SYSTEMS
P. 233
Unit 12: Natural Language Processing
(i.e., declension and conjugation). This involves considering a set of tags describing grammatical Notes
categories of the word form concerned, most notably, its base form (lemma) and paradigm.
Automatic analysis of word forms in free text can be used for instance in grammar checker
development, and can aid corpus tagging, or semi-automatic dictionary compiling.
12.4.1 Semantic Analysis
Semantic and pragmatic analysis make up the most complex phase of language processing as
they build up on results of all the above mentioned disciplines. Based on the knowledge about
the structure of words and sentences, the meaning of words, phrases, sentences and texts is
stipulated, and subsequently also their purpose and consequences. From the computational
point of view, no general solutions that would be adequate have been proposed for this area.
There are many open theoretical problems, and in practice, great problems are caused by errors
on lower processing levels. The ultimate touchstone on this level is machine translation, which
hasn’t been implemented for Czech with satisfactory results yet.
One of the long-term projects of the NLP laboratory is the use of Transparent Intensional Logic
(TIL) as a semantic representation of knowledge and subsequently as a transfer language in
automatic machine translation. At the current stage, it is realistic to process knowledge in a
simpler form – considerably less complex tasks have been addressed, such as machine translation
for a restricted domain (e.g., official documents and weather reports), or semi-automatic machine
translation between close languages.
!
Caution The resources exploited in these applications are corpora, semantic nets, and
electronic dictionaries.
In linguistics, c to the level of the writing as a whole, to their language-independent meanings.
It also involves removing features specific to particular linguistic and cultural contexts, to the
extent that such a project is possible. The elements of idiom and figurative speech, being cultural,
are often also converted into relatively invariant meanings in semantic analysis.
12.4.2 Syntactic Analysis
Parsing or syntactic analysis is the process of analysing a string of symbols, either in natural
language or in computer languages, according to the rules of a formal grammar. The term
“parsing” comes from Latin “pars”, meaning part (of speech).
The term has slightly different meanings in different branches of linguistics and computer
science. Traditional sentence parsing is often performed as a method of understanding the exact
meaning of a sentence, sometimes with the aid of devices such as sentence diagrams. It usually
emphasizes the importance of grammatical divisions such as subject and predicate.
Within computational linguistics the term is used to refer to the formal analysis by computer of
a sentence or other string of words into its constituents, resulting in a parse tree showing their
syntactic relation to each other, which may also contain semantic and other information.
Did u know? The term “syntactic analysis” is also used in psycholinguistics when
describing language comprehension. In this context, parsing refers to the way that human
beings analyze a sentence or phrase (in spoken language or text) “in terms of grammatical
constituents, identifying the parts of speech, syntactic relations, etc.” This term is especially
common when discussing what linguistic cues help speakers to interpret garden-path
sentences.
LOVELY PROFESSIONAL UNIVERSITY 227