media-central.indianexpress.com
robots.txt

Robots Exclusion Standard data for media-central.indianexpress.com

Resource Scan

Scan Details

Site Domain media-central.indianexpress.com
Base Domain indianexpress.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-02-11T00:31:22+00:00
Next Scan 2024-05-11T00:31:22+00:00

Last Successful Scan

Scanned2023-03-20T22:03:03+00:00
URL https://media-central.indianexpress.com/robots.txt
Domain IPs 23.39.4.128
Response IP 23.39.4.128
Found Yes
Hash 923929c26f3e1ec648d42471715af990b6a782b9cc3735df6d30d94d54862e6a
SimHash 98145010c7f3

Groups

*

Rule Path
Disallow /

twitterbot

Rule Path
Allow /media/gni/

facebookexternalhit

Rule Path
Allow /media/gni/