dewalist.com
robots.txt

Robots Exclusion Standard data for dewalist.com

Resource Scan

Scan Details

Site Domain dewalist.com
Base Domain dewalist.com
Scan Status Ok
Last Scan4/5/2025, 1:10:33 AM
Next Scan 4/12/2025, 1:10:33 AM

Last Scan

Scanned4/5/2025, 1:10:33 AM
URL https://dewalist.com/robots.txt
Domain IPs 104.21.52.225, 172.67.204.237, 2606:4700:3030::6815:34e1, 2606:4700:3035::ac43:cced
Response IP 172.67.204.237
Found Yes
Hash 12180b449f3e6b253f08ab9c04e5b4c03aeda9f321bc182622cb178acdbacf2f
SimHash 2b2b185e0567

Groups

*

Rule Path
Allow /
Disallow /plugins/
Disallow /libs/
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*%26sort_type%3D
Disallow /*confirm.html*
Disallow /*listing-details.html*
Disallow /*print.html*
Disallow /404.html*
Disallow /*listing-remove.html*
Disallow /*newsletter.html*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.dewalist.com/sitemap.xml

Comments

  • robots.txt
  • Rules generated by the Sitemap plugin
  • Excluded pages: