preservearticles.com
robots.txt
Robots Exclusion Standard data for preservearticles.com
Resource Scan
Scan Details
Site Domain | preservearticles.com |
Base Domain | preservearticles.com |
Scan Status | Ok |
Last Scan | 2025-10-03T01:31:57+00:00 |
Next Scan | 2025-10-10T01:31:57+00:00 |
Last Scan
Scanned | 2025-10-03T01:31:57+00:00 |
URL | https://preservearticles.com/robots.txt |
Domain IPs | 104.21.50.145, 172.67.207.2, 2606:4700:3033::6815:3291, 2606:4700:3034::ac43:cf02 |
Response IP | 104.21.50.145 |
Found | Yes |
Hash | 9f71b7e05f6ae638c924a6c5a25bb6e20eb54bedfd3c905d2bf9646bc7d8f397 |
SimHash | 65394153c6b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /wp-content/plugins/ |
Disallow | /feed/ |
Disallow | */feed/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
Disallow | /search? |
Disallow | /search/ |
Disallow | /page/ |
Disallow | /author/ |
Disallow | /home/ |
Disallow | /? |
Disallow | /?attachment_id |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 15 |