cmtithalat.com
robots.txt

Robots Exclusion Standard data for cmtithalat.com

Resource Scan

Scan Details

Site Domain cmtithalat.com
Base Domain cmtithalat.com
Scan Status Ok
Last Scan2025-10-14T08:00:29+00:00
Next Scan 2025-11-13T08:00:29+00:00

Last Scan

Scanned2025-10-14T08:00:29+00:00
URL https://cmtithalat.com/robots.txt
Domain IPs 104.21.26.120, 172.67.168.71, 2606:4700:3033::ac43:a847, 2606:4700:3036::6815:1a78
Response IP 172.67.168.71
Found Yes
Hash 1dd6d86421da31fe6b5a768a993d6e0a7df5ec0c769f74bf47e7082b7835f773
SimHash 041dca646d91

Groups

*

Rule Path
Disallow /*route%3Daccount/
Disallow /*route%3Daffiliate/
Disallow /*route%3Dcheckout/
Disallow /*route%3Dproduct/search
Disallow /index.php?route=product%2Fproduct*&manufacturer_id=
Disallow /admin
Disallow /catalog
Disallow /system
Disallow /*?sort=
Disallow /*%26sort%3D
Disallow /*?order=
Disallow /*%26order%3D
Disallow /*?limit=
Disallow /*%26limit%3D
Disallow /*?format=
Disallow /*%26format%3D
Disallow /*?tracking=
Disallow /*%26tracking%3D
Disallow /*?filter=
Disallow /*%26filter%3D
Disallow /*?filter_name=
Disallow /*%26filter_name%3D
Disallow /*?filter_sub_category=
Disallow /*%26filter_sub_category%3D
Disallow /*?filter_description=
Disallow /*%26filter_description%3D

Other Records

Field Value
sitemap https://cmtithalat.com/index.php?route=extension/feed/google_sitemap

Warnings

  • `host` is not a known field.