cleanomatics.com
robots.txt

Robots Exclusion Standard data for cleanomatics.com

Resource Scan

Scan Details

Site Domain cleanomatics.com
Base Domain cleanomatics.com
Scan Status Ok
Last Scan2025-07-25T22:17:40+00:00
Next Scan 2025-08-24T22:17:40+00:00

Last Scan

Scanned2025-07-25T22:17:40+00:00
URL https://www.cleanomatics.com/robots.txt
Domain IPs 103.103.196.139
Response IP 103.103.196.139
Found Yes
Hash 2cebd43eae3cfd7fd03d57de43b571931490277173a85d48896083fb903336d6
SimHash 243ccd55c7e3

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /private/
Disallow /cgi-bin/

googlebot

Rule Path
Disallow /test/
Disallow /tmp/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.cleanomatics.com/sitemap.xml

Comments

  • Allow all search engines access to everything
  • Specific rules for some user agents (if needed)
  • Sitemap file location
  • Crawl-delay directive to avoid overloading the server (optional, adjust as needed)