nutrivene.com
robots.txt

Robots Exclusion Standard data for nutrivene.com

Resource Scan

Scan Details

Site Domain nutrivene.com
Base Domain nutrivene.com
Scan Status Ok
Last Scan2025-11-13T00:54:33+00:00
Next Scan 2025-12-13T00:54:33+00:00

Last Scan

Scanned2025-11-13T00:54:33+00:00
URL https://nutrivene.com/robots.txt
Domain IPs 192.124.249.12
Response IP 192.124.249.12
Found Yes
Hash a31f726efe0eeaf09c95a460fb181c15245c8b7e31ea14b78e3e7a22dd235183
SimHash 6d684b5442b9

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins
Disallow /wp-content/wfcache/
Disallow /wp-content/upgrade/
Disallow /comments/feed/
Disallow /category/
Disallow /tag/
Disallow /archives/
Disallow /cgi-bin/
Disallow /*?*
Disallow *?replytocom

googlebot-image

Rule Path
Allow /wp-content/uploads/
Allow /wp-content/themes/

mediapartners-google*

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap http://www.example.com/sitemap_index.xml

Comments

  • Disallow all files in these directories
  • Disallow WordPress core crawl
  • Disallow WordPress taxonomy crawl
  • Disallow server & qQuerystring crawl
  • Allow WordPress content crawl
  • Allow Google crawl