indianexpresss.in
robots.txt

Robots Exclusion Standard data for indianexpresss.in

Resource Scan

Scan Details

Site Domain indianexpresss.in
Base Domain indianexpresss.in
Scan Status Ok
Last Scan2025-02-15T06:44:35+00:00
Next Scan 2025-03-17T06:44:35+00:00

Last Scan

Scanned2025-02-15T06:44:35+00:00
URL https://indianexpresss.in/robots.txt
Domain IPs 104.21.76.141, 172.67.196.26, 2606:4700:3033::ac43:c41a, 2606:4700:3035::6815:4c8d
Response IP 172.67.196.26
Found Yes
Hash a66c58b949300c4b6e0de6f8e06a3f91cbc7e19e5136f79f7bf111e1db4ede88
SimHash 7938f970cf93

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login/*
Disallow /pages__trashed/
Disallow /?
Disallow *?s=
Disallow /search
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /reg/
Disallow /click

Other Records

Field Value
sitemap https://indianexpresss.in/sitemap.xml