toolbox.com
robots.txt

Robots Exclusion Standard data for toolbox.com

Resource Scan

Scan Details

Site Domain toolbox.com
Base Domain toolbox.com
Scan Status Ok
Last Scan2025-12-12T07:04:44+00:00
Next Scan 2025-12-19T07:04:44+00:00

Last Scan

Scanned2025-12-12T07:04:44+00:00
URL https://toolbox.com/robots.txt
Domain IPs 141.193.213.10, 141.193.213.11
Response IP 141.193.213.10
Found Yes
Hash e0fcf26bb3d7c5405700213b21b73410db5edccb223ac0e39c98a8384408923a
SimHash 65517911e681

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow

slurp

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

twitterbot/1.0

Rule Path
Disallow

twitterbot 1.0

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow

ahrefssiteaudit

Rule Path
Disallow

screaming frog

Rule Path
Disallow

screaming frog seo spider

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

bingbot

Rule Path
Disallow

*

Rule Path
Disallow /
Disallow /cgi-bin/
Disallow /research/
Disallow /*?
Disallow /sw-marketing/*

*

Rule Path
Disallow /legal-resource-center/*
Disallow /offers/*
Disallow /cdn-cgi/

amazonbot
anthropic-ai
applebot
applebot-extended
bytespider
ccbot
claudebot
claude-web
cohere-ai
diffbot
facebookbot
gptbot
httrack
nutch
offline explorer
omgili
scrapy
youbot
meta-externalagent
timpibot
duckassistbot
perplexity-user
ai2bot-dolma
meta-externalfetcher
chatgpt-user
perplexitybot
oai-searchbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.spiceworks.com/sitemap_index.xml

Warnings

  • 1 invalid line.