irth.in
robots.txt

Robots Exclusion Standard data for irth.in

Resource Scan

Scan Details

Site Domain irth.in
Base Domain irth.in
Scan Status Ok
Last Scan2025-10-14T17:36:19+00:00
Next Scan 2025-11-13T17:36:19+00:00

Last Scan

Scanned2025-10-14T17:36:19+00:00
URL https://irth.in/robots.txt
Redirect https://www.irth.in/robots.txt
Redirect Domain www.irth.in
Redirect Base irth.in
Domain IPs 104.18.14.148, 104.18.15.148, 2606:4700::6812:e94, 2606:4700::6812:f94
Redirect IPs 104.18.14.148, 104.18.15.148, 2606:4700::6812:e94, 2606:4700::6812:f94
Response IP 104.18.14.148
Found Yes
Hash a9bcb0cf69ac737911b2a4fd557b318569d16297f469551f1f14f907209cc3ab
SimHash ca04ca56e553

Groups

*

Rule Path
Disallow /cart*
Disallow /checkout*
Disallow /myaccount*
Disallow /search?*
Disallow /order/track*
Disallow /wps/contenthandler/*
Disallow /Sites-Irth-Site/*
Disallow *Search-UpdateGrid*
Disallow *?lang=en_IN*
Disallow *_p.html

googlebot-image

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.irth.in/sitemap_index.xml