webgiaxe.vn
robots.txt

Robots Exclusion Standard data for webgiaxe.vn

Resource Scan

Scan Details

Site Domain webgiaxe.vn
Base Domain webgiaxe.vn
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-03-19T23:27:13+00:00
Next Scan 2025-06-17T23:27:13+00:00

Last Successful Scan

Scanned2024-05-01T22:43:47+00:00
URL https://webgiaxe.vn/robots.txt
Domain IPs 104.21.49.172, 172.67.148.5, 2606:4700:3030::6815:31ac, 2606:4700:3033::ac43:9405
Response IP 104.21.49.172
Found Yes
Hash 6f12f12e833a32b36fadeb8eb09c11dd2dfa8da4c305d5ab6804948cf0ebd379
SimHash 2054d48345b1

Groups

*

Rule Path
Disallow /admin/

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

gigabot

Rule Path
Disallow /