sgclark.com
robots.txt

Robots Exclusion Standard data for sgclark.com

Resource Scan

Scan Details

Site Domain sgclark.com
Base Domain sgclark.com
Scan Status Ok
Last Scan2026-03-18T20:45:55+00:00
Next Scan 2026-03-25T20:45:55+00:00

Last Scan

Scanned2026-03-18T20:45:55+00:00
URL https://sgclark.com/robots.txt
Domain IPs 34.174.23.105
Response IP 34.174.23.105
Found Yes
Hash e5cfd755b7619a0a692449db726d7a3507e72d8978b83ed71beaffc55a5eac20
SimHash e85d1d508fd6

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Disallow /wp-admin/

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sgclark.com/sitemap_index.xml

Comments

  • Group 1
  • Group 2
  • Dark Visitors Robots.txt
  • AI Data Scraper
  • https://darkvisitors.com/agents/bytespider
  • AI Data Scraper
  • https://darkvisitors.com/agents/ccbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/diffbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/facebookbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/google-extended
  • AI Data Scraper
  • https://darkvisitors.com/agents/gptbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/omgili
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/anthropic-ai
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/claude-web
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/claudebot
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/cohere-ai