SYSTRAN.io platform is a collection of APIs for Translation, Multilingual Dictionary lookups, Natural Language Processing (Entity recognition, Morphological analysis, Part of Speech tagging, Language Identification…) and Text Extraction (from documents, audio files or images).
SYSTRAN Platform - API Collection for Translation and Natural Language Processing
SYSTRAN Platform enables you to utilize and analyze both structured and unstructured multilingual content, such as user-generated content, social media, Web content and more. Easy to use, Scalable and reliable, the new SYSTRAN Platform brings the power of SYSTRAN's best-of-breed language processing technologies to your apps and websites.
SYSTRAN Platform is a collection of REST APIs, Client Libraries and samples for Text extraction, Translation, and Natural Language Processing.
SYSTRAN Platform is free for small volumes and testing purposes, monthly subscriptions are available for higher volumes
You can try here a basic subset of the features or you can Sign up to SYSTRAN Platform for full featured access (including uploading files for translation or speech recognition) : https://platform.systran.net
SYSTRAN machine translation
Translates automatically documents from one language to another
Automatically identifies which language documents are written in, through specific word- or sentence-sample detection.
Named Entity Recognition
Based on the analysis of the document contents, automatically recognizes and displays person names, locations, numbers, dates, organization names, …
Segmentation and Tokenization
Segments text into sentences and sentences into "tokens" (minimal processing units).
Makes transcription of words or entities between languages with different scripts.
Provides morphological analysis for individual words, returning the list of possible lemmas and parts of speech for an inflected form.
Upload and edit your dictionaries
Search within SYSTRAN multilingual dictionaries, obtain translations with additional contextual information such as frequency of meanings, domains and contexts, expressions and examples, the search can also be done within your own dictionaries
Corpus Management and fuzzy matches
Upload and edit your corpus
Searches your corpus and returns exact and fuzzy matches of a given sentence
Extracts text from various document formats for processing by other modules and Rebuilds the document with the modified or annotated text.
Supported formats: txt, html, htm, xhtml, xml, tmx, xliff, xlf, docx, pptx, xlsx, rtf, odp, ods, odt, json, po, ts, properties, resx, aspx, android xml
Converts spoken words from audio files into written text.
Supported audio file formats: AAC, AIFF, ASF, FLAC, MS-Wave, MPEG, Ogg/Vorbis, Nist Sphere, Sun AU
Optical Character Recognition
Extracts text from images and scanned documents