A simple web crawler and scraper has been developed to extract data from websites. The crawler operates on Scrapy, while the scraper utilizes BeautifulSoup. At present, the scraper is configured to extract structured data specifically from the Trustpilot.com website. For other sites, the scraper is designed to return unstructured text data.
[✅] extract_from_url API:
[✅] extract_from_trustpilot_url API:
Project link: https://github.com/sarapiscitelli/web-data-extraction