boxmash.com
robots.txt

Robots Exclusion Standard data for boxmash.com

Resource Scan

Scan Details

Site Domain boxmash.com
Base Domain boxmash.com
Scan Status Ok
Last Scan2024-10-31T14:31:32+00:00
Next Scan 2024-11-07T14:31:32+00:00

Last Scan

Scanned2024-10-31T14:31:32+00:00
URL https://boxmash.com/robots.txt
Domain IPs 104.21.8.183, 172.67.157.165, 2606:4700:3030::ac43:9da5, 2606:4700:3035::6815:8b7
Response IP 172.67.157.165
Found Yes
Hash 8d0f0e316e938bfc616dc98b05aaeeb6f7e6998ace8fb5a9ec9e19895b18b7bb
SimHash 4918ca40d237

Groups

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google

Rule Path
Disallow

duggmirror

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow /category/*/*
Disallow */trackback/
Disallow */feed/
Disallow */comments/
Disallow /*?
Allow /wp-content/uploads/

Other Records

Field Value
crawl-delay 120

Other Records

Field Value
sitemap http://www.boxmash.com/sitemap.xml

Comments

  • Google Image
  • Google AdSense
  • digg mirror
  • global