imgci.com
robots.txt

Robots Exclusion Standard data for imgci.com

Resource Scan

Scan Details

Site Domain imgci.com
Base Domain imgci.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-05T16:22:12+00:00
Next Scan 2025-01-03T16:22:12+00:00

Last Successful Scan

Scanned2021-12-23T11:37:04+00:00
URL http://imgci.com/robots.txt
Redirect https://www.espncricinfo.com/robots.txt
Redirect Domain www.espncricinfo.com
Redirect Base espncricinfo.com
Response IP 23.64.122.33
Found Yes
Hash 01411e078420be0ac53af5651ee005d25940e5e3ba900c020b50abec728a2643
SimHash e9473a0ce38d

Groups

*

Rule Path
Disallow /*wrappertype%3Dprint
Disallow /*/content/url/
Disallow /*/content/current/url/
Disallow /error
Disallow /fragments/
Disallow /logos/
Disallow /country-fragment/
Disallow /country-fragment2/
Disallow /cgi-bin/
Disallow /classes/
Disallow /format/
Disallow /frames/
Disallow /db/HELPFILES/
Disallow /db/MANAGEMENT/
Disallow /db/MISC/CRICINFO_DATA/
Disallow /db/SUPPORT/ADVERTS/
Disallow /db/SUPPORT/AFP/
Disallow /db/SUPPORT/BSTAR/
Disallow /db/SUPPORT/DAWN/
Disallow /db/SUPPORT/DAWSON/
Disallow /db/SUPPORT/ET/
Disallow /db/SUPPORT/JAGGED/
Disallow /db/SUPPORT/SHOP/
Disallow /*?cmp
Disallow /*?addata
Disallow /*?wrappertype=print