captainverify.com
robots.txt

Robots Exclusion Standard data for captainverify.com

Resource Scan

Scan Details

Site Domain captainverify.com
Base Domain captainverify.com
Scan Status Ok
Last Scan2024-10-19T19:06:42+00:00
Next Scan 2024-11-18T19:06:42+00:00

Last Scan

Scanned2024-10-19T19:06:42+00:00
URL https://captainverify.com/robots.txt
Domain IPs 151.80.162.136
Response IP 151.80.162.136
Found Yes
Hash 91bf4102302bb97a66f1a45d3d3c6084534961eacc20e243deb305ec7e215d28
SimHash 50145971c0b5

Groups

*

Rule Path
Allow /*

amazonbot
applebot
anthropic-ai
blexbot
bytespider
ccbot
chatgpt-user
claude-web
cohere-ai
crazywebcrawler-spider
diffbot
etaospider
facebookbot
friendlycrawler
imagesiftbot
google-extended
gptbot
omgili
omgilibot
meltwater
perplexitybot
shopwiki
sosospider
seekr
textbulkerbot
twengabot-2.0
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://captainverify.com/sitemap-en.xml
sitemap https://captainverify.com/sitemap-de.xml
sitemap https://captainverify.com/sitemap-es.xml
sitemap https://captainverify.com/sitemap-fr.xml
sitemap https://captainverify.com/sitemap-it.xml
sitemap https://captainverify.com/sitemap-pt.xml