wastecorp.com
robots.txt

Robots Exclusion Standard data for wastecorp.com

Resource Scan

Scan Details

Site Domain wastecorp.com
Base Domain wastecorp.com
Scan Status Ok
Last Scan2024-10-19T09:29:11+00:00
Next Scan 2024-11-18T09:29:11+00:00

Last Scan

Scanned2024-10-19T09:29:11+00:00
URL https://wastecorp.com/robots.txt
Domain IPs 192.124.249.102
Response IP 192.124.249.102
Found Yes
Hash cee2f0f99611325b5a9ce765f777317ba401bd37f6142673b8fb8ac096060afd
SimHash cc0988719d94

Groups

*

Rule Path
Disallow /*%26limit
Disallow /*?sort
Disallow /*%26sort
Disallow /*?route=checkout%2F
Disallow /*?route=account%2F
Disallow /*?route=product%2Fsearch
Disallow /*?route=product%2Fcompare
Disallow /*?route=product%2Fspecial
Disallow /*?route=product%2Fcustomize
Disallow /*?route=common%2Flocale
Disallow /*?route=case_study%2Fcase_study
Disallow /*?route=product%2Fcategory
Disallow /*?route=information%2Fsitemap
Disallow /*?route=information%2Fcontact
Disallow /*?_route_=
Disallow /*?p=

Other Records

Field Value
sitemap https://wastecorp.com/sitemap.xml