ipwatchdog.com
robots.txt

Robots Exclusion Standard data for ipwatchdog.com

Resource Scan

Scan Details

Site Domain ipwatchdog.com
Base Domain ipwatchdog.com
Scan Status Ok
Last Scan2024-04-29T06:35:42+00:00
Next Scan 2024-05-29T06:35:42+00:00

Last Scan

Scanned2024-04-29T06:35:42+00:00
URL https://ipwatchdog.com/robots.txt
Domain IPs 162.159.136.54, 162.159.137.54, 2606:4700:7::a29f:8836, 2606:4700:7::a29f:8936
Response IP 162.159.136.54
Found Yes
Hash 59458605d7c83b0f93e8ef11c533d6d8bf2f26a283dda267a1a09ae8e13798c5
SimHash 20156713f6c5

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow /comments
Disallow */trackback
Disallow */comments
Allow /wp-content/uploads
Disallow /*?*
Disallow /*?
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

Other Records

Field Value
sitemap https://ipwatchdog.com/sitemap.xml.gz

Comments

  • disallow all files with ? in url
  • disallow all files ending with these extensions
  • allow google image bot to search all images
  • allow Google adsense bot on entire site
  • BEGIN XML-SITEMAP-PLUGIN
  • END XML-SITEMAP-PLUGIN