waste-management-world.com
robots.txt

Robots Exclusion Standard data for waste-management-world.com

Resource Scan

Scan Details

Site Domain waste-management-world.com
Base Domain waste-management-world.com
Scan Status Ok
Last Scan2026-03-05T03:11:21+00:00
Next Scan 2026-03-12T03:11:21+00:00

Last Scan

Scanned2026-03-05T03:11:21+00:00
URL https://waste-management-world.com/robots.txt
Domain IPs 104.21.89.54, 172.67.188.35, 2606:4700:3033::6815:5936, 2606:4700:3037::ac43:bc23
Response IP 172.67.188.35
Found Yes
Hash 11f0002eb3c4f7591d1a200e228428183f3d0a5bb27f5f8ed350e1eba343b617
SimHash e1501d5635b6

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /api$
Disallow /weka/user/loginstate$
Disallow /rss.xml$

Other Records

Field Value
sitemap https://waste-management-world.com/sitemaps-2-sitemap.xml
sitemap https://waste-management-world.com/news.xml

Comments

  • robots.txt for https://waste-management-world.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/