scrapingdog.com
robots.txt

Robots Exclusion Standard data for scrapingdog.com

Resource Scan

Scan Details

Site Domain scrapingdog.com
Base Domain scrapingdog.com
Scan Status Ok
Last Scan2025-11-17T06:13:50+00:00
Next Scan 2025-11-24T06:13:50+00:00

Last Scan

Scanned2025-11-17T06:13:50+00:00
URL https://scrapingdog.com/robots.txt
Domain IPs 104.26.0.42, 104.26.1.42, 172.67.68.209, 2606:4700:20::681a:12a, 2606:4700:20::681a:2a, 2606:4700:20::ac43:44d1
Response IP 104.26.0.42
Found Yes
Hash 22de52cf9b6dabed095e807e2cd9a4e34469b6d9fb5586b6eb3265cd471384c9
SimHash 2da8935aa791

Groups

*

Rule Path Comment
Disallow /wp-admin/ -
Allow /wp-admin/admin-ajax.php -
Disallow /serpapi-alternative/ Block this alternative page

Other Records

Field Value
sitemap https://www.scrapingdog.com/sitemap_index.xml

Comments

  • \ /
  • (o o) ==> Get Fresh Public Data at Scale using Scrapingdog suite of APIs
  • ___/| |\___
  • /___/ \___\
  • This file is created by Divanshu
  • LinkedIn: https://www.linkedin.com/in/divanshu-khatter-20a7a711a/
  • Block specific "alternative" pages
  • Sitemap here