geneticliteracyproject.org
robots.txt

Robots Exclusion Standard data for geneticliteracyproject.org

Resource Scan

Scan Details

Site Domain geneticliteracyproject.org
Base Domain geneticliteracyproject.org
Scan Status Ok
Last Scan2024-10-01T13:47:39+00:00
Next Scan 2024-10-08T13:47:39+00:00

Last Scan

Scanned2024-10-01T13:47:39+00:00
URL https://geneticliteracyproject.org/robots.txt
Domain IPs 104.26.12.32, 104.26.13.32, 172.67.75.101
Response IP 172.67.75.101
Found Yes
Hash 226265a89b4f457a48e5e7fd40cb3478648e0f1e88a43540503cbe4fa0afff0e
SimHash e426dc5a8422

Groups

ahrefsbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://geneticliteracyproject.org/sitemap_index.xml
sitemap https://geneticliteracyproject.org/news-sitemap.xml

Comments

  • Commented out as per ticket 12417773
  • User-agent: *
  • Added per Ticket 12417773
  • LKnight