whatthedata.cc
robots.txt

Robots Exclusion Standard data for whatthedata.cc

Resource Scan

Scan Details

Site Domain whatthedata.cc
Base Domain whatthedata.cc
Scan Status Ok
Last Scan2026-02-08T16:42:55+00:00
Next Scan 2026-02-22T16:42:55+00:00

Last Scan

Scanned2026-02-08T16:42:55+00:00
URL https://whatthedata.cc/robots.txt
Domain IPs 104.21.83.65, 172.67.215.161, 2606:4700:3031::6815:5341, 2606:4700:3032::ac43:d7a1
Response IP 104.21.83.65
Found Yes
Hash 7f90a3c57d60e5c1b39a13304c5084b9083e0e79a5ce267a8cd7c386393f2d1a
SimHash 495094534733

Groups

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://whatthedata.cc/sitemap/sitemap-index.xml

Warnings

  • `host` is not a known field.