trainman.in
robots.txt

Robots Exclusion Standard data for trainman.in

Resource Scan

Scan Details

Site Domain trainman.in
Base Domain trainman.in
Scan Status Ok
Last Scan2024-11-10T20:38:41+00:00
Next Scan 2024-11-17T20:38:41+00:00

Last Scan

Scanned2024-11-10T20:38:41+00:00
URL https://trainman.in/robots.txt
Redirect https://www.trainman.in/robots.txt
Redirect Domain www.trainman.in
Redirect Base trainman.in
Domain IPs 20.192.98.161
Redirect IPs 23.215.7.25, 23.215.7.26, 2600:1413:1::6011:b430, 2600:1413:1::6011:b432
Response IP 96.17.180.48
Found Yes
Hash ceffcb0571a8ac1536e1c15f00c229abbb78bee9943e27f3f4666272a6c5994f
SimHash 7024ac6acbe3

Groups

*

Rule Path
Disallow /pnr/*
Disallow /services/*
Disallow /*?*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.trainman.in/sitemap.xml
sitemap https://www.trainman.in/sitemap/sitemap-book-rail-ticket.xml
sitemap https://www.trainman.in/sitemap/sitemap-train-coach-position.xml
sitemap https://www.trainman.in/sitemap/sitemap-train-fare.xml
sitemap https://www.trainman.in/sitemap/sitemap-train-seat-availability.xml
sitemap https://www.trainman.in/sitemap/sitemap-train-running-status.xml

Comments

  • 22-Mar-24 URL parameters blocking for SEO
  • 22-Mar-24 adding SEO city to city page sitemap
  • 22-Mar-24 to avoid GPT crowler
  • www.trainman.in