techearl.com
robots.txt

Robots Exclusion Standard data for techearl.com

Resource Scan

Scan Details

Site Domain techearl.com
Base Domain techearl.com
Scan Status Ok
Last Scan2026-01-13T06:08:00+00:00
Next Scan 2026-01-20T06:08:00+00:00

Last Scan

Scanned2026-01-13T06:08:00+00:00
URL https://techearl.com/robots.txt
Domain IPs 104.21.37.5, 172.67.202.39, 2606:4700:3030::ac43:ca27, 2606:4700:3035::6815:2505
Response IP 104.21.37.5
Found Yes
Hash 05e698a6cb5c5bb19244c29893f368527f1ca4e2f1b4a10a30eeaba35b9dc199
SimHash 51144941c191

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

semanticbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://techearl.com/sitemap.xml