dailyhaha.com
robots.txt
Robots Exclusion Standard data for dailyhaha.com
Resource Scan
Scan Details
Site Domain | dailyhaha.com |
Base Domain | dailyhaha.com |
Scan Status | Ok |
Last Scan | 2024-05-13T02:42:54+00:00 |
Next Scan | 2024-05-20T02:42:54+00:00 |
Last Scan
Scanned | 2024-05-13T02:42:54+00:00 |
URL | https://dailyhaha.com/robots.txt |
Redirect | https://www.dailyhaha.com/robots.txt |
Redirect Domain | www.dailyhaha.com |
Redirect Base | dailyhaha.com |
Domain IPs | 104.21.234.188, 104.21.234.189, 2606:4700:3038::6815:eabc, 2606:4700:3038::6815:eabd |
Redirect IPs | 104.21.234.188, 104.21.234.189, 2606:4700:3038::6815:eabc, 2606:4700:3038::6815:eabd |
Response IP | 104.21.234.188 |
Found | Yes |
Hash | 02d392c03c6d94415930cdbcbfd126c648d8eb434a4b2da1a682dfaf3bd4f478 |
SimHash | 285d1fc8a5f2 |
Groups
googlebot
Rule | Path |
---|---|
Disallow | /spon/ |
Disallow | /banners/ |
Disallow | /*.swf$ |
Disallow | /*.cgi$ |
Disallow | /*.jsp$ |
Disallow | /*.html$ |
Disallow | /*.info$ |
Disallow | /*.SITE%3D |
Disallow | /*.ws$ |
Disallow | /*SITE%3D |
Disallow | /*%3C |
*
Rule | Path |
---|---|
Disallow | /*.php* |
Disallow | /*.cgi* |
Disallow | /*.jsp* |
Disallow | /*.html* |
Disallow | /*.info* |
Disallow | /*.SITE%3D* |
Disallow | /*.ws* |
Comments