corpus.io
robots.txt

Robots Exclusion Standard data for corpus.io

Resource Scan

Scan Details

Site Domain corpus.io
Base Domain corpus.io
Scan Status Ok
Last Scan2025-10-01T01:40:46+00:00
Next Scan 2025-10-31T01:40:46+00:00

Last Scan

Scanned2025-10-01T01:40:46+00:00
URL https://corpus.io/robots.txt
Domain IPs 85.13.142.231
Response IP 85.13.142.231
Found Yes
Hash 20b31a37c98de9817e97b1ed3112ee5b5747efa037843d6a0a8d5d44e3e34d60
SimHash 4900dc008db3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://corpus.io/wp-sitemap.xml