itpathsolutions.com
robots.txt

Robots Exclusion Standard data for itpathsolutions.com

Resource Scan

Scan Details

Site Domain itpathsolutions.com
Base Domain itpathsolutions.com
Scan Status Ok
Last Scan2026-03-12T04:12:48+00:00
Next Scan 2026-04-11T04:12:48+00:00

Last Scan

Scanned2026-03-12T04:12:48+00:00
URL https://itpathsolutions.com/robots.txt
Redirect https://www.itpathsolutions.com/robots.txt
Redirect Domain www.itpathsolutions.com
Redirect Base itpathsolutions.com
Domain IPs 104.21.23.234, 172.67.214.36, 2606:4700:3030::ac43:d624, 2606:4700:3037::6815:17ea
Redirect IPs 104.21.23.234, 172.67.214.36, 2606:4700:3030::ac43:d624, 2606:4700:3037::6815:17ea
Response IP 172.67.214.36
Found Yes
Hash 458f69ab8487cc877e9887e1f0815819003fde513bbecee51ce388fe9e24f28c
SimHash 507ed140c56a

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

google-extended

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

ccbot

Rule Path
Disallow /

amazonbot

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

dataforseobot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://itpathsolutions.com/sitemap_index.xml

Comments

  • OpenAI (ChatGPT / GPTBot)
  • OpenAI's WebCrawler (used for ChatGPT Search / oai)
  • Anthropic (Claude)
  • Google (Gemini / Bard / AI Overviews)
  • Perplexity.ai
  • Scale AI (data labeling)
  • Amazon (Titan / Alexa Teacher Model)
  • AppleBot (Siri / Apple LLM training)
  • Common scrapers to block