intechnews.com
robots.txt

Robots Exclusion Standard data for intechnews.com

Resource Scan

Scan Details

Site Domain intechnews.com
Base Domain intechnews.com
Scan Status Ok
Last Scan2025-12-13T18:40:15+00:00
Next Scan 2026-01-12T18:40:15+00:00

Last Scan

Scanned2025-12-13T18:40:15+00:00
URL https://intechnews.com/robots.txt
Domain IPs 104.21.48.241, 172.67.157.3, 2606:4700:3033::ac43:9d03, 2606:4700:3035::6815:30f1
Response IP 172.67.157.3
Found Yes
Hash e289706993d32223d586fbbef2b978b5e8512f1efd89b9b22f203cc9fec75b2d
SimHash 61198951c412

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

youbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://intechnews.com/sitemap.xml