hdgat.com
robots.txt

Robots Exclusion Standard data for hdgat.com

Resource Scan

Scan Details

Site Domain hdgat.com
Base Domain hdgat.com
Scan Status Ok
Last Scan2025-04-10T13:11:36+00:00
Next Scan 2025-05-10T13:11:36+00:00

Last Scan

Scanned2025-04-10T13:11:36+00:00
URL https://hdgat.com/robots.txt
Domain IPs 104.21.71.132, 172.67.170.148, 2606:4700:3035::6815:4784, 2606:4700:3036::ac43:aa94
Response IP 172.67.170.148
Found Yes
Hash d333abfd55cbc72368331b99f325ecb551ddaca434e7aafb5e9a93922f268191
SimHash 525fc8426bbb

Groups

*

Rule Path
Disallow /view$
Disallow /view?
Disallow /t/
Disallow /s/

bingbot

Rule Path
Disallow /view$
Disallow /view?
Disallow /t/
Disallow /s/

Other Records

Field Value
crawl-delay 3

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

special_archiver

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

paracrawl

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /