chegae.com
robots.txt

Robots Exclusion Standard data for chegae.com

Resource Scan

Scan Details

Site Domain chegae.com
Base Domain chegae.com
Scan Status Ok
Last Scan2025-11-03T23:10:49+00:00
Next Scan 2025-12-03T23:10:49+00:00

Last Scan

Scanned2025-11-03T23:10:49+00:00
URL https://chegae.com/robots.txt
Domain IPs 104.21.29.48, 172.67.171.103, 2606:4700:3030::ac43:ab67, 2606:4700:3032::6815:1d30
Response IP 104.21.29.48
Found Yes
Hash b7c2474592127d0eba2bbd943c23ac8c6a9b76d6d178f33fea29e2f14ab743f3
SimHash aa124804e9d9

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /author/
Disallow /tag/
Disallow /comments/
Disallow /page/
Disallow /2010/
Disallow /2011/
Disallow /cgi-bin/
Disallow /wp-content/
Disallow /*/feed*
Disallow /?s=
Disallow /*.js$
Disallow /*.css$
Disallow /*.cgi$

Other Records

Field Value
sitemap http://chegae.com/sitemap.xml.gz