Knowledge-Driven Multilingual Text Analysis and Transparent Information Retrieval
Language Technology for Industrial Applications
Springer
ISBN 978-3-031-91740-0
Standardpreis
Bibliografische Daten
Fachbuch
Buch. Hardcover
2025
1 s/w-Abbildung, 166 Farbabbildungen.
In englischer Sprache
Umfang: xi, 394 S.
Format (B x L): 15,5 x 23,5 cm
Verlag: Springer
ISBN: 978-3-031-91740-0
Weiterführende bibliografische Daten
Das Werk ist Teil der Reihe: Cognitive Technologies
Produktbeschreibung
To do so, the author investigates all text analysis components required for multilingual indexing, starting from preparatory work like language and topic identification, continuing with sentence splitting and tokenization (including Chinese), and describing lexical analysis, also for multiword entries and Named Entities. Entries are then disambiguated both on syntactic (by a tagger) and semantic level (by multilingual word sense disambiguation). The analysis results are collected in a dynamic multilingual ConceptNet, which is an index structure extended by monolingual relations (like synonyms, or head-modifier links) as well as multilingual ones (translations). In addition to many European languages also Turkish, Arabic, Persian, and Chinese are treated.
The book concludes with a description of components needed to build the required resources, like crawlers, bilingual term extraction, and tools for defaulting linguistic annotations. For each component, readers will find a technology overview, a discussion of its main challenges in computational treatment, a description of the technical solution selected, and evaluation information.
Autorinnen und Autoren
Produktsicherheit
Hersteller
Springer Nature Customer Service Center GmbH
ProductSafety@springernature.com