jasonsantamaria.com
robots.txt

Robots Exclusion Standard data for jasonsantamaria.com

Resource Scan

Scan Details

Site Domain jasonsantamaria.com
Base Domain jasonsantamaria.com
Scan Status Ok
Last Scan2024-10-05T03:10:05+00:00
Next Scan 2024-11-04T03:10:05+00:00

Last Scan

Scanned2024-10-05T03:10:05+00:00
URL https://jasonsantamaria.com/robots.txt
Domain IPs 13.251.96.10, 18.139.194.139
Response IP 13.215.144.61
Found Yes
Hash aefd09a09bff4d03c76569ed73f1203f9dd6ad43a9f1b77c100dd32900d43c57
SimHash fa5f51518757

Groups

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

Comments

  • AI Data Scraper
  • https://darkvisitors.com/agents/anthropic-ai
  • AI Data Scraper
  • https://darkvisitors.com/agents/bytespider
  • AI Data Scraper
  • https://darkvisitors.com/agents/ccbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/diffbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/facebookbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/google-extended
  • AI Data Scraper
  • https://darkvisitors.com/agents/gptbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/omgili
  • AI Data Scraper
  • https://darkvisitors.com/agents/applebot-extended