article scraper

FREEMIUM
By ODLICA | Updated vor einem Monat | News, Media
Health Check

100%

README

Article Scraper API Documentation

Overview

The Article Scraper API provides a robust solution for extracting structured information from articles sourced from various websites. The response format includes data nodes, each representing a specific element from the article, such as images, text, headings, and more.

Response Structure

The API response consists of the following key components:

  • ‘data’: An array containing information about different elements within the article.

    Each element in the array includes the following fields:

      - 'group_heading': A heading associated with a group of nodes.
      - 'group_number': An identifier for the group.
      - 'is_node_heading', 'is_node_image', 'is_node_list': Flags indicating the type of node.
      - 'node_alt': Alternative text for image nodes.
      - 'node_name': Name of the node.
      - 'node_src': Source URL for image nodes.
      - 'node_type': Type of the node (e.g., 'img', 'text', 'p', 'h1').
      - 'position_info': Information about the position and dimensions of the node.
    
  • ‘status’: Indicates the status of the API request (e.g., ‘success’ or ‘error’).

  • ‘msg’: Additional information or error messages.

Node Types

  1. Image Node (node_type: ‘img’): Image nodes represent images within the article. They include information such as the source URL (node_src) and alternative text (node_alt).

  2. Text Node (node_type: ‘text’, ‘p’): Text nodes represent textual content within the article. The actual content can be accessed using the node_content field.

  3. Heading Node (node_type: ‘h1’): Heading nodes represent headings within the article. The actual heading text can be accessed using the node_content field.

Followers: 2
Resources:
Product Website
API Creator:
Rapid account: ODLICA
ODLICA
odlica
Log In to Rate API
Rating: 5 - Votes: 1