ghanaweb.live
robots.txt

Robots Exclusion Standard data for ghanaweb.live

Resource Scan

Scan Details

Site Domain ghanaweb.live
Base Domain ghanaweb.live
Scan Status Ok
Last Scan2024-11-14T00:36:09+00:00
Next Scan 2024-11-21T00:36:09+00:00

Last Scan

Scanned2024-11-14T00:36:09+00:00
URL https://ghanaweb.live/robots.txt
Domain IPs 104.21.95.42, 172.67.142.251, 2606:4700:3034::ac43:8efb, 2606:4700:3037::6815:5f2a
Response IP 104.21.95.42
Found Yes
Hash bb4961e30abf59b8f263ae92aa2d9b190f4e003f46e9f5e1d4403b4c573aea7b
SimHash cb350510cc03

Groups

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /headlines/
Disallow /GhanaHomePage/classifieds/archive.php
Disallow /sil/
Disallow /cdn-cgi/
Disallow *feedback.php
Disallow /validate_user.php?url=*

Other Records

Field Value
sitemap https://www.ghanaweb.live/sitemaps/sitemap.xml
sitemap https://www.ghanaweb.live/sitemaps/articles.xml
sitemap https://www.ghanaweb.live/sitemaps/news.xml
sitemap https://www.ghanaweb.live/sitemaps/videos.xml

Comments

  • Disallow: /GhanaHomePage/classifieds/show.photo.php
  • Disallow: /GhanaHomePage/world/
  • Disallow: *comment=
  • Disallow: *?audio=1
  • Disallow: *&nav=