dewalist.com
robots.txt
Robots Exclusion Standard data for dewalist.com
Resource Scan
Scan Details
Site Domain | dewalist.com |
Base Domain | dewalist.com |
Scan Status | Ok |
Last Scan | 4/5/2025, 1:10:33 AM |
Next Scan | 4/12/2025, 1:10:33 AM |
Last Scan
Scanned | 4/5/2025, 1:10:33 AM |
URL | https://dewalist.com/robots.txt |
Domain IPs | 104.21.52.225, 172.67.204.237, 2606:4700:3030::6815:34e1, 2606:4700:3035::ac43:cced |
Response IP | 172.67.204.237 |
Found | Yes |
Hash | 12180b449f3e6b253f08ab9c04e5b4c03aeda9f321bc182622cb178acdbacf2f |
SimHash | 2b2b185e0567 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /plugins/ |
Disallow | /libs/ |
Disallow | /includes/ |
Disallow | /print* |
Disallow | /*?sort_by= |
Disallow | /*%26sort_by%3D |
Disallow | /*?sort_type= |
Disallow | /*%26sort_type%3D |
Disallow | /*confirm.html* |
Disallow | /*listing-details.html* |
Disallow | /*print.html* |
Disallow | /404.html* |
Disallow | /*listing-remove.html* |
Disallow | /*newsletter.html* |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Other Records
Field | Value |
---|---|
sitemap | https://www.dewalist.com/sitemap.xml |
Comments