Deep Categorization is MeaningCloud's solution for in-depth rule-based categorization. It assigns one or more categories to a text, using a very detailed rule-based language that allows you to identify very specific scenarios and patterns using a combination of morphological, semantic and text rules.
The Document Structure Analysis extracts different sections of a given document with markup content (which includes formatted documents such as PDF or Microsoft Word files), including the title, headings, abstract and parts of an email. This process, even though it takes into account some language markers, is based mainly in the markup of the document, so it can be applied to documents in any language.
This service provides detailed linguistic information for a given text in English, Spanish, French, Italian, Portuguese and Catalan. There are three operating modes that cover different aspects of the morphosyntactic and semantic analysis: Lemmatization, which provides the lemmas of the different words in a text; PoS tagging: which provides not only the grammatical category of a word, including semantic information about that word; Syntactic analysis: that provides a thorough syntactic analysis, giving a complete syntactic tree where the leaves represent the most basic elements and their morphological and semantic analyses.