dailyhaha.com
robots.txt

Robots Exclusion Standard data for dailyhaha.com

Resource Scan

Scan Details

Site Domain dailyhaha.com
Base Domain dailyhaha.com
Scan Status Ok
Last Scan2024-05-13T02:42:54+00:00
Next Scan 2024-05-20T02:42:54+00:00

Last Scan

Scanned2024-05-13T02:42:54+00:00
URL https://dailyhaha.com/robots.txt
Redirect https://www.dailyhaha.com/robots.txt
Redirect Domain www.dailyhaha.com
Redirect Base dailyhaha.com
Domain IPs 104.21.234.188, 104.21.234.189, 2606:4700:3038::6815:eabc, 2606:4700:3038::6815:eabd
Redirect IPs 104.21.234.188, 104.21.234.189, 2606:4700:3038::6815:eabc, 2606:4700:3038::6815:eabd
Response IP 104.21.234.188
Found Yes
Hash 02d392c03c6d94415930cdbcbfd126c648d8eb434a4b2da1a682dfaf3bd4f478
SimHash 285d1fc8a5f2

Groups

googlebot

Rule Path
Disallow /spon/
Disallow /banners/
Disallow /*.swf$
Disallow /*.cgi$
Disallow /*.jsp$
Disallow /*.html$
Disallow /*.info$
Disallow /*.SITE%3D
Disallow /*.ws$
Disallow /*SITE%3D
Disallow /*%3C

*

Rule Path
Disallow /*.php*
Disallow /*.cgi*
Disallow /*.jsp*
Disallow /*.html*
Disallow /*.info*
Disallow /*.SITE%3D*
Disallow /*.ws*

ia_archiver

Rule Path
Disallow /

Comments

  • robots.txt file for http://www.dailyhaha.com/
  • 9/6/2012
  • end of file