gidonline.cx
robots.txt

Robots Exclusion Standard data for gidonline.cx

Resource Scan

Scan Details

Site Domain gidonline.cx
Base Domain gidonline.cx
Scan Status Ok
Last Scan2025-06-12T02:48:27+00:00
Next Scan 2025-07-12T02:48:27+00:00

Last Scan

Scanned2025-06-12T02:48:27+00:00
URL https://gidonline.cx/robots.txt
Redirect https://gidonline.net/robots.txt
Redirect Domain gidonline.net
Redirect Base gidonline.net
Domain IPs 104.21.11.66, 172.67.165.83, 2606:4700:3035::ac43:a553, 2606:4700:3037::6815:b42
Redirect IPs 37.1.219.167
Response IP 37.1.219.167
Found Yes
Hash 4ce6bcda3d95fc52e7c4999c64b898f0070b50873422e8ab115982b9b84de40a
SimHash d10db0694531

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /user/
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*do%3Dgo
Disallow /search/

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Warnings

  • `host` is not a known field.