funkyspacemonkey.com
robots.txt

Robots Exclusion Standard data for funkyspacemonkey.com

Resource Scan

Scan Details

Site Domain funkyspacemonkey.com
Base Domain funkyspacemonkey.com
Scan Status Ok
Last Scan2024-11-17T23:26:34+00:00
Next Scan 2024-11-24T23:26:34+00:00

Last Scan

Scanned2024-11-17T23:26:34+00:00
URL https://funkyspacemonkey.com/robots.txt
Domain IPs 89.41.38.22
Response IP 89.41.38.22
Found Yes
Hash 2c66d3cd8f6f895da6255a2c696ca7487fa99599f548f115fe78d8a636bc2a08
SimHash 6c186793f691

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /z/j/
Disallow /z/c/
Disallow /stats/
Disallow /dh_
Disallow /about/
Disallow /contact/
Disallow /tag/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /contact
Disallow /manual
Disallow /manual/*
Disallow /phpmanual/
Disallow /category/
Allow /ads.txt

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$

duggmirror

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

*

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap http://www.funkyspacemonkey.com/sitemap.xml.gz
sitemap http://cdn.attracta.com/sitemap/925481.xml.gz
sitemap https://www.funkyspacemonkey.com/sitemap_index.xml

Comments

  • disallow all files in these directories
  • disallow all files ending with these extensions
  • disable duggmirror
  • allow google image bot to search all images
  • allow adsense bot on entire site
  • BEGIN XML-SITEMAP-PLUGIN
  • END XML-SITEMAP-PLUGIN