twopendata.com
robots.txt
Robots Exclusion Standard data for twopendata.com
Resource Scan
Scan Details
Site Domain | twopendata.com |
Base Domain | twopendata.com |
Scan Status | Ok |
Last Scan | 2025-08-25T15:38:54+00:00 |
Next Scan | 2025-09-01T15:38:54+00:00 |
Last Scan
Scanned | 2025-08-25T15:38:54+00:00 |
URL | https://www.twopendata.com/robots.txt |
Domain IPs | 104.21.77.25, 172.67.203.223, 2606:4700:3036::6815:4d19, 2606:4700:3036::ac43:cbdf |
Response IP | 104.21.77.25 |
Found | Yes |
Hash | 08eea924abdd050e6002d07b29cf70dc8634b744869f0b616db5b43667427048 |
SimHash | e926ee5a53d1 |
Groups
*
Rule | Path |
---|---|
Allow | /*?PageSpeed=noscript |
Disallow | /d/ |
Disallow | /e/ |
Disallow | /*?* |
Disallow | /*.php |
Disallow | /cdn-cgi/ |
Disallow | /cache/ |
Disallow | /law/ |
Disallow | /laws/ |
Disallow | /ecachefiles/ |
Disallow | /template/ |
Disallow | /templates/ |
Disallow | /so/ |
Disallow | /keyword/ |
Disallow | /php/ |
Disallow | /docs/ |
Disallow | /templates/ |
Disallow | /about.html |
Disallow | /privacy.html |