/.well-known/

Log In Sign Up

gulaku.cc
robots.txt

Robots Exclusion Standard data for gulaku.cc

Archived Snapshots

Resource Scan

Scan Details

Site Domain	gulaku.cc
Base Domain	gulaku.cc
Scan Status	Ok
Last Scan	2025-10-07T16:06:50+00:00
Next Scan	2025-11-06T16:06:50+00:00

Last Scan

Scanned	2025-10-07T16:06:50+00:00
URL	https://gulaku.cc/robots.txt
Domain IPs	104.21.79.233, 172.67.149.236, 2606:4700:3030::6815:4fe9, 2606:4700:3034::ac43:95ec
Response IP	104.21.79.233
Found	Yes
Hash	0b2aef9fbf9f349b48de4f62a03292955bca148c0ba7c0f1c2680374f7036345
SimHash	69051895c5d1

Groups

googlebot
slurp
bingbot

Rule

Path

Allow

/

*

Rule

Path

Allow

/

ia_archiver

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://gulaku.cc/sitemap.xml

Back to top