tgdir.org
robots.txt

Robots Exclusion Standard data for tgdir.org

Resource Scan

Scan Details

Site Domain tgdir.org
Base Domain tgdir.org
Scan Status Ok
Last Scan2025-10-16T19:32:39+00:00
Next Scan 2025-10-23T19:32:39+00:00

Last Scan

Scanned2025-10-16T19:32:39+00:00
URL https://tgdir.org/robots.txt
Domain IPs 193.42.111.183
Response IP 193.42.111.183
Found Yes
Hash b395270c3709d926d4a5faffb96a183d7f70fb5c848e67c325f09ee4d153dcad
SimHash 510c44404e13

Groups

*

Rule Path
Allow /

dotbot

Rule Path
Disallow /

ahrefsbot/7.0

Rule Path
Disallow /

clark-crawler2/nutch-1.19-snapshot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot/7~bl

Rule Path
Disallow /

mj12bot/v1.4.8

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

dataforseobot/1.0

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

rogerbot/1.2

Rule Path
Disallow /

yeti

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tgdir.org/sitemap.xml