Texts API: How To Use the API | RapidAPI

Texts

FREEMIUM
By apihood
Updated 2 months ago
Text Analysis
4.4/10
Popularity Score
410ms
Latency
100%
Success Rate

Texts API Overview

Providing lemmatizer for 10 languages (Bulgarian, Czech, English, Estonian, French, Hungarian, Romanian, Slovak, Slovene and Ukrainian) together with various term statistics like term frequency, term density, automated readability index…

Contact API Provider
star-blueRate

Description

Purpose of this API is to provide various text analytics tools.

We are starting with providing lemmatizer with automatic language detection for following languages:

  • Bulgarian (bg)
  • Czech (cs)
  • English (en)
  • Estonian (et)
  • French (fr)
  • Hungarian (hu)
  • Romanian (ro)
  • Slovak (sk)
  • Slovene (sl)
  • Ukrainian (uk)

We also provide various statistics like term frequency, term density, automated readability index, reading time estimate, speaking time estimate etc (please see example response in endpoints tab for more information)

The lemmatizer is based on Lemmagen project and is using lexicons derived from MULTEXT east free dictionaries published under Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license. The source dictionary contains triplets: wordform, lemma and morphosyntactic description (MSD). The triplets are used for generating lemmatization rules (Ripple Down Rules). The quality of the derived lexicons depends on the source triplets quality.

NOTE: The lemmatizer is not context aware. Text is first tokenized then each token is analyzed individually using Lemmagen and the lemma is returned.

Disclaimer

API Hood does not warrant that the provided data will be free from errors or omissions, because used lexicons have a different quality in terms of completeness and precision.

Endpoints

GET Lemmatizer

API exposes GET /lemmatizer endpoint with following parameters. See endpoints tab for interactive documentation and example response.

Parameter Required Default Type Description
text string Text for analysis
language detected string Language of the text, available values: "bg", "cs", "en", "et", "fr", "hu", "ro", "sk", "sl", "uk". When missing, API tries to detect language from the text.

POST Lemmatizer

API exposes POST /lemmatizer endpoint which accepts Content-Type: application/json body payload. See endpoints tab for interactive documentation and example response.

Parameter Required Default Type Description
text string Text for analysis
language detected string Language of the text, available values: "bg", "cs", "en", "et", "fr", "hu", "ro", "sk", "sl", "uk". When missing, API tries to detect language from the text.
lowercase false boolean Boolean flag if the text should be lowercased. Defaults to false
rules [] array[string] Array of strings in format ["wordform => lemma"] to override default behaviour from the lexicon

Changelog & Planned features

For changelog and planned features see Text Analysis API changelog.

Research purposes

We want to support researchers interested in the API use, so if you want to use the API for research purposes and need more requests, contact us for a custom plan and we will try to do our best to support your research.

Unlimited access

If you need a plan with unlimited requests, contact us for a custom plan.

Install SDK for (Node.js)Unirest

OAuth2 Authentication
Client ID
Client Secret
OAuth2 Authentication