vicesnob.com
robots.txt

Robots Exclusion Standard data for vicesnob.com

Resource Scan

Scan Details

Site Domain vicesnob.com
Base Domain vicesnob.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-01T15:35:49+00:00
Next Scan 2026-04-02T15:35:49+00:00

Last Successful Scan

Scanned2025-12-02T06:55:42+00:00
URL https://vicesnob.com/robots.txt
Domain IPs 104.21.12.239, 172.67.153.238, 2606:4700:3030::6815:cef, 2606:4700:3035::ac43:99ee
Response IP 104.21.12.239
Found Yes
Hash eff3d5cb40598cd9a7e0f0d4feeaef4e361cb6754f5aa4ddaef2a4a4925546e3
SimHash 2c6e58127278

Groups

bruqibot

Rule Path
Disallow /

dmca-bot

Rule Path
Disallow /

mj12bot

Product Comment
mj12bot Known for aggressive scraping
Rule Path
Disallow /

*

Rule Path Comment
Disallow /*?comments -
Disallow /wp-comments-post.php -
Disallow /wp-admin/* Critical security fix
Disallow /wp-login.php* -
Disallow /wp-includes/ -
Disallow /tag/ -
Disallow /author/ -
Disallow /cdn-cgi/ -
Disallow /*/feed/ -
Disallow /*/comments/ -
Disallow /*?s= -
Disallow /*?is_otto_page_fetch= -
Disallow /*?p= -
Disallow /*?currency= -
Disallow /*?fbclid= -
Disallow /*?doing_wp_cron= -
Disallow /license.txt -
Disallow /readme.html Prevent WordPress version leaks

Other Records

Field Value
sitemap https://www.vicesnob.com/sitemap_index.xml

Comments

  • BLOCK COPYRIGHT TROLL BOTS FIRST
  • UNIVERSAL RULES
  • SITEMAPS (keep at bottom)