ingredientsonline.com
robots.txt

Robots Exclusion Standard data for ingredientsonline.com

Resource Scan

Scan Details

Site Domain ingredientsonline.com
Base Domain ingredientsonline.com
Scan Status Ok
Last Scan2026-01-18T06:53:57+00:00
Next Scan 2026-02-17T06:53:57+00:00

Last Scan

Scanned2026-01-18T06:53:57+00:00
URL https://ingredientsonline.com/robots.txt
Redirect https://www.ingredientsonline.com/robots.txt
Redirect Domain www.ingredientsonline.com
Redirect Base ingredientsonline.com
Domain IPs 172.66.40.118, 172.66.43.138
Redirect IPs 172.66.40.118, 172.66.43.138
Response IP 172.66.43.138
Found Yes
Hash 0416c5072f850eb4adb4c440f81d8427b7ad9291c14646f95702ef878d0b0b5d
SimHash 4135b7985641

Groups

hubspot crawler
claude-searchbot
claude-user
perplexity-user
perplexitybot
google-extended
oai-searchbot
chatgpt-user

Rule Path
Allow /

googlebot
googlebot-image
googlebot-news
mediapartners-google
googlebot-mobile
adsbot-google
bingbot
slurp
semrushbot

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /customgraphql/*
Disallow /customg
Disallow /catalogsearch/result/
Disallow /*?q=*
Disallow /login/
Disallow /*?segment=

Other Records

Field Value
sitemap https://www.ingredientsonline.com/sitemap.xml