Given the example URL, letโs set up a hypothetical request to extract information from an Amazon product page for the Universal Web Scraper. This walkthrough will show how to collect details such as the product name, price, description, and customer reviews.
{
"target_url": "https://www.amazon.com/Quencher-FlowState-Stainless-Insulated-Smoothie/dp/B0CP9Z56SW",
"keys_list": [
{ "key_to_extract": "product_name", "description_of_key": "name of the product" },
{ "key_to_extract": "price", "description_of_key": "price of the product" },
{ "key_to_extract": "description", "description_of_key": "product description" },
{ "key_to_extract": "customer_reviews", "description_of_key": "customer reviews of the product" }
]
}
baseUrl/parser
endpoint with the above JSON payload.baseUrl/parser/:id/:timestamp
to track the progress and eventually retrieve the processed data.Upon completion, the service would provide a JSON response containing the extracted data. While I cannot fetch real-time data or interact with external systems directly, based on the typical structure of an Amazon product page, you could expect something like this:
{
"id": "####-####-####-####",
"timestamp": 123456,
"status": "Complete",
"processed_data": {
"product_name": "Quencher FlowState Stainless Insulated Smoothie Tumbler",
"price": "$35.99",
"description": "Experience the perfect blend of style and functionality with our Quencher FlowState Tumbler, ideal for smoothies, coffee, and more.",
"customer_reviews": [
"Absolutely love this tumbler! Keeps my drinks cold for hours.",
"Stylish and practical - a must-have for anyone on the go!"
]
}
}
This example showcases how you can effectively utilize the Universal Web Scraper to standardize information from e-commerce sites like Amazon into a structured JSON format, aiding in data analysis and decision-making processes.