gaygalls.net
robots.txt

Robots Exclusion Standard data for gaygalls.net

Resource Scan

Scan Details

Site Domain gaygalls.net
Base Domain gaygalls.net
Scan Status Ok
Last Scan5/16/2025, 8:01:06 AM
Next Scan 6/15/2025, 8:01:06 AM

Last Scan

Scanned5/16/2025, 8:01:06 AM
URL https://gaygalls.net/robots.txt
Domain IPs 104.21.67.165, 172.67.178.124, 2606:4700:3036::ac43:b27c, 2606:4700:3037::6815:43a5
Response IP 104.21.67.165
Found Yes
Hash c1e9d3607d16b5f9fc1606af8fe57c138fbd1af7e250b7c9daa4e8a194e7b624
SimHash c1029a3545f5

Groups

*

Rule Path
Allow /
Disallow /debug
Disallow /*/debug
Disallow /admin
Disallow /admin*
Disallow /jmx
Disallow /jmx*
Disallow /cgi-bin
Disallow /account/not_my_account
Disallow /wp-content/uploads/users/*/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap http://gaygalls.net/sitemap.xml

Comments

  • robots.txt
  • Wait 1 second between successive requests. See ONBOARD-2698 for details.

Warnings

  • `host` is not a known field.