ingredientsonline.com
robots.txt
Robots Exclusion Standard data for ingredientsonline.com
Resource Scan
Scan Details
| Site Domain | ingredientsonline.com |
| Base Domain | ingredientsonline.com |
| Scan Status | Ok |
| Last Scan | 2026-01-18T06:53:57+00:00 |
| Next Scan | 2026-02-17T06:53:57+00:00 |
Last Scan
| Scanned | 2026-01-18T06:53:57+00:00 |
| URL | https://ingredientsonline.com/robots.txt |
| Redirect | https://www.ingredientsonline.com/robots.txt |
| Redirect Domain | www.ingredientsonline.com |
| Redirect Base | ingredientsonline.com |
| Domain IPs | 172.66.40.118, 172.66.43.138 |
| Redirect IPs | 172.66.40.118, 172.66.43.138 |
| Response IP | 172.66.43.138 |
| Found | Yes |
| Hash | 0416c5072f850eb4adb4c440f81d8427b7ad9291c14646f95702ef878d0b0b5d |
| SimHash | 4135b7985641 |
Groups
hubspot crawler
claude-searchbot
claude-user
perplexity-user
perplexitybot
google-extended
oai-searchbot
chatgpt-user
| Rule | Path |
|---|---|
| Allow | / |
googlebot
googlebot-image
googlebot-news
mediapartners-google
googlebot-mobile
adsbot-google
bingbot
slurp
semrushbot
| Rule | Path |
|---|---|
| Disallow | /index.php/ |
| Disallow | /*? |
| Disallow | /checkout/ |
| Disallow | /app/ |
| Disallow | /lib/ |
| Disallow | /*.php$ |
| Disallow | /pkginfo/ |
| Disallow | /report/ |
| Disallow | /var/ |
| Disallow | /catalog/ |
| Disallow | /customer/ |
| Disallow | /sendfriend/ |
| Disallow | /review/ |
| Disallow | /*SID%3D |
| Disallow | /customgraphql/* |
| Disallow | /customg |
| Disallow | /catalogsearch/result/ |
| Disallow | /*?q=* |
| Disallow | /login/ |
| Disallow | /*?segment= |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.ingredientsonline.com/sitemap.xml |