nagariknews.com
robots.txt

Robots Exclusion Standard data for nagariknews.com

Resource Scan

Scan Details

Site Domain nagariknews.com
Base Domain nagariknews.com
Scan Status Ok
Last Scan2024-06-01T19:15:09+00:00
Next Scan 2024-07-01T19:15:09+00:00

Last Scan

Scanned2024-06-01T19:15:09+00:00
URL https://nagariknews.com/robots.txt
Redirect https://nagariknews.nagariknetwork.com/robots.txt
Redirect Domain nagariknews.nagariknetwork.com
Redirect Base nagariknetwork.com
Domain IPs 3.0.205.60, 3.1.226.53, 52.220.222.161
Redirect IPs 13.225.4.14, 13.225.4.46, 13.225.4.69, 13.225.4.76
Response IP 13.225.4.14
Found Yes
Hash 779eff072243880064ce1942721c0d55095eaf6b4fc826ffe7c58cc4cb601be2
SimHash a8188910c9e1

Groups

*

Rule Path
Disallow

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /amp

googlebot-image

Rule Path
Allow /

Comments

  • Disallow everything.
  • Certain social media sites are whitelisted to allow crawlers to access page markup when links to /images are shared.