curiousarchive.com
robots.txt

Robots Exclusion Standard data for curiousarchive.com

Resource Scan

Scan Details

Site Domain curiousarchive.com
Base Domain curiousarchive.com
Scan Status Ok
Last Scan2024-10-28T02:08:37+00:00
Next Scan 2024-11-04T02:08:37+00:00

Last Scan

Scanned2024-10-28T02:08:37+00:00
URL https://curiousarchive.com/robots.txt
Redirect https://www.curiousarchive.com/robots.txt
Redirect Domain www.curiousarchive.com
Redirect Base curiousarchive.com
Domain IPs 138.68.188.101
Redirect IPs 138.68.188.101
Response IP 138.68.188.101
Found Yes
Hash e684df199d7ed66f00db5210ca9471eaf4a749cde47ba9f801e728fe6a0302d4
SimHash e900c800cfb3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

twitterbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.curiousarchive.com/wp-sitemap.xml