headlinestoday.in
robots.txt

Robots Exclusion Standard data for headlinestoday.in

Resource Scan

Scan Details

Site Domain headlinestoday.in
Base Domain headlinestoday.in
Scan Status Ok
Last Scan2024-05-26T15:55:31+00:00
Next Scan 2024-06-02T15:55:31+00:00

Last Scan

Scanned2024-05-26T15:55:31+00:00
URL https://www.headlinestoday.in/robots.txt
Domain IPs 23.44.4.211, 23.44.4.242, 2600:1413:1::1734:ab60, 2600:1413:1::1734:ab72
Response IP 42.99.140.137
Found Yes
Hash 744247e0149b64dc6d89b21fb73fb0362184315d25b9f085e2969e4b750ea274
SimHash 38e1f10065f3

Groups

*

Rule Path
Disallow /technology/*
Disallow /top-news/*
Disallow /news/*
Disallow /entertainment/*
Disallow /health/*
Disallow /sports/*
Disallow /trending-video-it/*
Disallow /vegitable/*
Disallow /fuel/*
Disallow /live-stream/*
Disallow /times-now/*
Disallow /toi/*
Disallow /hindustantimes/*
Disallow /trending/*
Disallow /search/*