atiusamy.com
robots.txt

Robots Exclusion Standard data for atiusamy.com

Resource Scan

Scan Details

Site Domain atiusamy.com
Base Domain atiusamy.com
Scan Status Ok
Last Scan2024-04-21T22:48:36+00:00
Next Scan 2024-05-21T22:48:36+00:00

Last Scan

Scanned2024-04-21T22:48:36+00:00
URL https://atiusamy.com/robots.txt
Redirect https://www.atiusamy.com/robots.txt
Redirect Domain www.atiusamy.com
Redirect Base atiusamy.com
Domain IPs 104.21.49.49, 172.67.188.250, 2606:4700:3031::6815:3131, 2606:4700:3035::ac43:bcfa
Redirect IPs 104.21.49.49, 172.67.188.250, 2606:4700:3031::6815:3131, 2606:4700:3035::ac43:bcfa
Response IP 104.21.49.49
Found Yes
Hash c6067800a0514cfe61ee57206e0acf8ad9e90a76eb5e8ab10e39cdc330b3c841
SimHash ea1f51418457

Groups

turnitinbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

Comments

  • Dark Visitors robots.txt
  • AI Data Scraper
  • https://darkvisitors.com/agents/anthropic-ai
  • AI Data Scraper
  • https://darkvisitors.com/agents/bytespider
  • AI Data Scraper
  • https://darkvisitors.com/agents/ccbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/facebookbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/google-extended
  • AI Data Scraper
  • https://darkvisitors.com/agents/gptbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/omgili