in.gururu.tw
robots.txt

Robots Exclusion Standard data for in.gururu.tw

Resource Scan

Scan Details

Site Domain in.gururu.tw
Base Domain gururu.tw
Scan Status Ok
Last Scan2025-09-09T06:28:57+00:00
Next Scan 2025-10-09T06:28:57+00:00

Last Scan

Scanned2025-09-09T06:28:57+00:00
URL http://in.gururu.tw/robots.txt
Domain IPs 172.217.194.121, 2404:6800:4003:c01::79
Response IP 172.253.118.121
Found Yes
Hash ac535581a7cc43ea6e884b4f22f2e3847196e20be95af7a1883b0240270f4b23
SimHash 4914d8504f53

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Disallow /share-widget
Allow /

Other Records

Field Value
sitemap http://in.gururu.tw/sitemap.xml