daralakhbar.com
robots.txt

Robots Exclusion Standard data for daralakhbar.com

Resource Scan

Scan Details

Site Domain daralakhbar.com
Base Domain daralakhbar.com
Scan Status Ok
Last Scan2024-06-21T03:41:53+00:00
Next Scan 2024-07-05T03:41:53+00:00

Last Scan

Scanned2024-06-21T03:41:53+00:00
URL https://daralakhbar.com/robots.txt
Redirect http://www.daralakhbar.com/robots.txt
Redirect Domain www.daralakhbar.com
Redirect Base daralakhbar.com
Domain IPs 104.21.65.133, 172.67.163.123, 2606:4700:3033::6815:4185, 2606:4700:3033::ac43:a37b
Redirect IPs 104.21.65.133, 172.67.163.123, 2606:4700:3033::6815:4185, 2606:4700:3033::ac43:a37b
Response IP 104.21.65.133
Found Yes
Hash 9e6b8158796f0f406dfe3535ee8533dc5d3263b7616182d4f058dff210c3cf0f
SimHash 6b8c2cc56450

Groups

*

Rule Path
Disallow /admin/

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap http://www.daralakhbar.com/sitemaps/tags/sitemap_index.xml.gz
sitemap http://www.daralakhbar.com/sitemaps/sections/sitemap_index.xml.gz
sitemap http://www.daralakhbar.com/sitemaps/sources/sitemap_index.xml.gz
sitemap http://www.daralakhbar.com/sitemaps/videos/sitemap_index.xml.gz
sitemap http://www.daralakhbar.com/sitemaps/articles/sitemap_index.xml.gz
sitemap http://www.daralakhbar.com/sitemaps/static/sitemap_index.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: Google
  • Disallow: /admin/
  • User-agent: Googlebot
  • Disallow: /admin/
  • User-agent: Mediapartners-Google
  • Disallow: /admin/
  • User-Agent: *
  • Disallow: /