thedjlist.com
robots.txt
Robots Exclusion Standard data for thedjlist.com
Resource Scan
Scan Details
Site Domain | thedjlist.com |
Base Domain | thedjlist.com |
Scan Status | Ok |
Last Scan | 2024-09-23T01:41:31+00:00 |
Next Scan | 2024-09-30T01:41:31+00:00 |
Last Scan
Scanned | 2024-09-23T01:41:31+00:00 |
URL | https://thedjlist.com/robots.txt |
Domain IPs | 104.22.12.231, 104.22.13.231, 172.67.26.129, 2606:4700:10::6816:ce7, 2606:4700:10::6816:de7, 2606:4700:10::ac43:1a81 |
Response IP | 104.22.12.231 |
Found | Yes |
Hash | 0bd12dc7a01e9ac537236318145d49cc18823ba4b120587b9cbe9bea0e38c48d |
SimHash | de64da028073 |
Groups
*
Rule | Path |
---|---|
Disallow | /*%3D$ |
Disallow | /refer/? |
Disallow | /members/* |
Disallow | /search/* |
Disallow | /gsearch/* |
Disallow | /corporate/* |
Disallow | /mix/*/download/ |
Disallow | /fb_login/ |
Disallow | /ajax_data/ |
Disallow | /unsubscribed/ |
Disallow | /mail_processing/ |
Disallow | /advertise/mixes/new/? |
Other Records
Field | Value |
---|---|
sitemap | http://thedjlist.com/sitemap.xml |
Warnings
- 2 invalid lines.