w3techs.com
robots.txt

Robots Exclusion Standard data for w3techs.com

Resource Scan

Scan Details

Site Domain w3techs.com
Base Domain w3techs.com
Scan Status Ok
Last Scan2024-09-14T18:20:35+00:00
Next Scan 2024-09-21T18:20:35+00:00

Last Scan

Scanned2024-09-14T18:20:35+00:00
URL https://w3techs.com/robots.txt
Domain IPs 88.99.136.18
Response IP 88.99.136.18
Found Yes
Hash 6751fcd75a75f52594ba37940a17ebc04d105fcf5b9444de1547085bc476bae4
SimHash 10155917c5d0

Groups

ia_archiver

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /