JSONpedia

免费增值
通过 michelemostarda | 已更新 22 дня назад | Other
Health Check

N/A

README

Read Wiki pages as JSON objects!

JSONpedia is a framework designed to simplify access at MediaWiki contents transforming everything into JSON. Such framework provides a library, a REST service and CLI tools to parse, convert, enrich and store WikiText documents. In order to facilitate the consumption of the huge availability of the MediaWiki semi-structured contents the converted JSON documents are stored both into ElasticSearch (providing advanced faceting support) and MongoDB (allowing distributed map/reduce tasks). JSONpedia supplies capabilities for recursive template expansion and mapping to DBpedia. This framework has been initially designed to extract linguistic resources from the Wikipedia dumps and to enable massive data scraping, the present intent of the project is to implement a general purpose infrastructure enabling Wikipedia multi language data consumption both for researchers and industry.

The response includes, other than the page DOM representation in JSON, also the following data:

  • Abstract
  • Sections Text
  • Section Titles
  • References (Wikimedia internal links)
  • Links (External links)
  • Tables
  • Templates
  • DBpedia Mappings
  • Freebase id
  • … much more to come!

For further information please visit the official project website: http://jsonpedia.org

关注者:30
资源:
产品网站 使用条款
API 创建者:
M
michelemostarda
michelemostarda
登录并给 API 打分
打分:5 - 投票:1