thebridge.in
robots.txt

Robots Exclusion Standard data for thebridge.in

Resource Scan

Scan Details

Site Domain thebridge.in
Base Domain thebridge.in
Scan Status Ok
Last Scan2024-06-20T03:16:46+00:00
Next Scan 2024-06-27T03:16:46+00:00

Last Scan

Scanned2024-06-20T03:16:46+00:00
URL https://thebridge.in/robots.txt
Domain IPs 104.21.54.37, 172.67.223.31, 2606:4700:3033::6815:3625, 2606:4700:3033::ac43:df1f
Response IP 172.67.223.31
Found Yes
Hash ddb5e99c200e96dd225cccb5f49d63e56e3443110bf7187ab22b674d1f7d385f
SimHash a0421a52cdd3

Groups

*

Rule Path
Allow /
Disallow /admin/*
Disallow /partnercontent/*
Disallow /xhr/*
Disallow /app-lite/*
Disallow /refer-*
Disallow /search/*
Disallow /search?*
Disallow /xhr/*
Disallow /preview/story-*
Disallow /amp/preview/story-*
Disallow /staging/*
Disallow /xhr/getNewsMixin*

Other Records

Field Value
sitemap https://thebridge.in/sitemap/sitemap-index.xml
sitemap https://thebridge.in/news-sitemap-daily.xml
sitemap https://thebridge.in/sitemap-daily.xml

Comments

  • robots.txt for