disci.org
robots.txt
Robots Exclusion Standard data for disci.org
Resource Scan
Scan Details
Site Domain | disci.org |
Base Domain | disci.org |
Scan Status | Ok |
Last Scan | 2025-10-11T16:16:10+00:00 |
Next Scan | 2025-10-18T16:16:10+00:00 |
Last Scan
Scanned | 2025-10-11T16:16:10+00:00 |
URL | https://disci.org/robots.txt |
Domain IPs | 104.21.38.234, 172.67.140.198, 2606:4700:3035::ac43:8cc6, 2606:4700:3037::6815:26ea |
Response IP | 172.67.140.198 |
Found | Yes |
Hash | 75bf825ced5ab6103ad64e8cb8cebfab96b6f8ac1ab559788abba869310105f0 |
SimHash | 44350b50cc14 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /cdn-cgi/ |
Disallow | /*add-to-cart%3D* |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://disci.org/sitemap_index.xml |
Warnings
- `content-signal` is not a known field.
Comments