irth.in
robots.txt
Robots Exclusion Standard data for irth.in
Resource Scan
Scan Details
| Site Domain | irth.in |
| Base Domain | irth.in |
| Scan Status | Ok |
| Last Scan | 2025-10-14T17:36:19+00:00 |
| Next Scan | 2025-11-13T17:36:19+00:00 |
Last Scan
| Scanned | 2025-10-14T17:36:19+00:00 |
| URL | https://irth.in/robots.txt |
| Redirect | https://www.irth.in/robots.txt |
| Redirect Domain | www.irth.in |
| Redirect Base | irth.in |
| Domain IPs | 104.18.14.148, 104.18.15.148, 2606:4700::6812:e94, 2606:4700::6812:f94 |
| Redirect IPs | 104.18.14.148, 104.18.15.148, 2606:4700::6812:e94, 2606:4700::6812:f94 |
| Response IP | 104.18.14.148 |
| Found | Yes |
| Hash | a9bcb0cf69ac737911b2a4fd557b318569d16297f469551f1f14f907209cc3ab |
| SimHash | ca04ca56e553 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cart* |
| Disallow | /checkout* |
| Disallow | /myaccount* |
| Disallow | /search?* |
| Disallow | /order/track* |
| Disallow | /wps/contenthandler/* |
| Disallow | /Sites-Irth-Site/* |
| Disallow | *Search-UpdateGrid* |
| Disallow | *?lang=en_IN* |
| Disallow | *_p.html |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.irth.in/sitemap_index.xml |