threedecks.org
robots.txt
Robots Exclusion Standard data for threedecks.org
Resource Scan
Scan Details
Site Domain | threedecks.org |
Base Domain | threedecks.org |
Scan Status | Ok |
Last Scan | 2025-10-11T09:59:13+00:00 |
Next Scan | 2025-10-18T09:59:13+00:00 |
Last Scan
Scanned | 2025-10-11T09:59:13+00:00 |
URL | https://threedecks.org/robots.txt |
Domain IPs | 104.21.0.219, 172.67.128.83, 2606:4700:3035::6815:db, 2606:4700:3037::ac43:8053 |
Response IP | 104.21.0.219 |
Found | Yes |
Hash | 2408d5de1315d30d042dcffe34ce64b80b3a9545b53df6e00b533f1dd16a7410 |
SimHash | 44354b51cdd5 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /ajax/ |
Disallow | /cgi-bin/ |
Disallow | /css/ |
Disallow | /datafiles/ |
Disallow | /js/ |
Disallow | /logs/ |
Disallow | /php/ |
Disallow | /scripts/ |
Disallow | /utilities/ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://threedecks.org/siteindex.xml |
Warnings
- `content-signal` is not a known field.
Comments