arxivx.com
robots.txt
Robots Exclusion Standard data for arxivx.com
Resource Scan
Scan Details
Site Domain | arxivx.com |
Base Domain | arxivx.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2025-04-29T16:09:19+00:00 |
Next Scan | 2025-07-28T16:09:19+00:00 |
Last Successful Scan
Scanned | 2024-09-09T16:07:23+00:00 |
URL | https://arxivx.com/robots.txt |
Redirect | https://a3.arxivx.com/robots.txt |
Redirect Domain | a3.arxivx.com |
Redirect Base | arxivx.com |
Domain IPs | 104.21.64.253, 172.67.138.128, 2606:4700:3031::6815:40fd, 2606:4700:3037::ac43:8a80 |
Redirect IPs | 104.21.64.253, 172.67.138.128, 2606:4700:3031::6815:40fd, 2606:4700:3037::ac43:8a80 |
Response IP | 172.67.138.128 |
Found | Yes |
Hash | 34085f432ab2af1cc0226eab9de994ff28c7b4b20b21ae2b8a3b4b2fd3d10715 |
SimHash | 4c51d05487e1 |
Groups
*
Rule | Path |
---|---|
Disallow | /find-new/ |
Disallow | /account/ |
Disallow | /attachments/ |
Disallow | /goto/ |
Disallow | /posts/ |
Disallow | /login/ |
Disallow | /members/ |
Disallow | /register/connected-accounts/ |
Disallow | /admin.php |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://a3.arxivx.com/sitemap.php |