cetaphil.in
robots.txt

Robots Exclusion Standard data for cetaphil.in

Resource Scan

Scan Details

Site Domain cetaphil.in
Base Domain cetaphil.in
Scan Status Ok
Last Scan2025-11-19T19:18:41+00:00
Next Scan 2025-12-19T19:18:41+00:00

Last Scan

Scanned2025-11-19T19:18:41+00:00
URL https://cetaphil.in/robots.txt
Redirect https://www.cetaphil.in/robots.txt
Redirect Domain www.cetaphil.in
Redirect Base cetaphil.in
Domain IPs 104.18.42.71, 172.64.145.185
Redirect IPs 104.18.42.71, 172.64.145.185
Response IP 104.18.42.71
Found Yes
Hash 50128ed5cf8f2c6d079c1d38a7be6eea68b48bd3a6d1f02a5514af7235343ac1
SimHash 080844b2a510

Groups

*

Rule Path
Disallow /*?
Allow /*.woff
Allow /*.woff2
Allow /*.ttf
Allow /*.eot
Allow /*.otf
Allow /*.svg
Allow /*.css
Allow /*.js
Allow /*.png
Allow /*.jpg
Allow /*.jpeg
Allow /*.webp
Allow /*.gif
Disallow /on/demandware.store/Sites-RefArch-Site/en_IN/Home-Show

Other Records

Field Value
sitemap https://www.cetaphil.in/sitemap_index.xml

Comments

  • Allow static resources like fonts, images, and CSS/JS even if they have parameters
  • Block specific irrelevant system URLs