gulaku.cc
robots.txt

Robots Exclusion Standard data for gulaku.cc

Resource Scan

Scan Details

Site Domain gulaku.cc
Base Domain gulaku.cc
Scan Status Ok
Last Scan2025-10-07T16:06:50+00:00
Next Scan 2025-11-06T16:06:50+00:00

Last Scan

Scanned2025-10-07T16:06:50+00:00
URL https://gulaku.cc/robots.txt
Domain IPs 104.21.79.233, 172.67.149.236, 2606:4700:3030::6815:4fe9, 2606:4700:3034::ac43:95ec
Response IP 104.21.79.233
Found Yes
Hash 0b2aef9fbf9f349b48de4f62a03292955bca148c0ba7c0f1c2680374f7036345
SimHash 69051895c5d1

Groups

googlebot
slurp
bingbot

Rule Path
Allow /

*

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gulaku.cc/sitemap.xml