thdis.com
robots.txt

Robots Exclusion Standard data for thdis.com

Resource Scan

Scan Details

Site Domain thdis.com
Base Domain thdis.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2026-01-12T21:00:50+00:00
Next Scan 2026-03-13T21:00:50+00:00

Last Successful Scan

Scanned2025-10-22T20:02:52+00:00
URL https://thdis.com/robots.txt
Domain IPs 18.156.88.174, 3.124.161.162, 3.74.190.245
Response IP 3.124.161.162
Found Yes
Hash 393300ae4ad225e78f06af1081db384d0ac904335f5fd3ffdd97fad075a6c0c7
SimHash 3c1d9e0369cb

Groups

*

Rule Path
Allow /
Disallow */login
Disallow */registration
Disallow *page%3D-*
Disallow *page%3D0*
Disallow *?page=-*
Disallow *?page=0*

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

googlebot

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

googlebot-image

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

googlebot-mobile

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

bingbot

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

applebot

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

yandexbot

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

duckduckbot

Rule Path
Allow /
Disallow *page%3D-*
Disallow *page%3D0*

Other Records

Field Value
sitemap https://thdis.com/sitemap.xml

Comments

  • High-volume SEO crawlers - Rate limit
  • AI Bot Control - Restrict training on content
  • Social Media Bots
  • Always welcome Google, Bing, and other mainstream search engines