im.rediff.com
robots.txt

Robots Exclusion Standard data for im.rediff.com

Resource Scan

Scan Details

Site Domain im.rediff.com
Base Domain rediff.com
Scan Status Ok
Last Scan2024-09-25T21:30:06+00:00
Next Scan 2024-10-02T21:30:06+00:00

Last Scan

Scanned2024-09-25T21:30:06+00:00
URL https://im.rediff.com/robots.txt
Domain IPs 23.215.7.10, 23.215.7.9, 2600:1413:b000:1b::17d7:709, 2600:1413:b000:1b::17d7:70a
Response IP 23.32.29.17
Found Yes
Hash 3840b1c645eda4403ac56bdcc7789b0fbe23475974e737554fa3c630afefcd1b
SimHash 2401bf483793

Groups

*

Rule Path
Disallow /uim/
Disallow /images/
Disallow /messageboard/
Disallow /newsletters/
Disallow /cms/print.jsp

Other Records

Field Value
sitemap https://www.rediff.com/sitemap.xml
sitemap https://www.rediff.com/gnewssitemap.xml

Comments

  • http://www.rediff.com: robots.txt