gemajitu.cc
robots.txt

Robots Exclusion Standard data for gemajitu.cc

Resource Scan

Scan Details

Site Domain gemajitu.cc
Base Domain gemajitu.cc
Scan Status Ok
Last Scan2025-10-15T03:49:08+00:00
Next Scan 2025-11-14T03:49:08+00:00

Last Scan

Scanned2025-10-15T03:49:08+00:00
URL https://gemajitu.cc/robots.txt
Domain IPs 104.21.28.118, 172.67.145.254, 2606:4700:3031::ac43:91fe, 2606:4700:3035::6815:1c76
Response IP 104.21.28.118
Found Yes
Hash 145a028a15960849011617faa7a05f46ffa040518c14468f073c6fca3c4169fa
SimHash 4f4881474731

Groups

googlebot
slurp
bingbot

Rule Path
Allow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://gemajitu.com/data-sitemap.xml

Warnings

  • `host` is not a known field.