ghanaweb.live
robots.txt

Robots Exclusion Standard data for ghanaweb.live

Resource Scan

Scan Details

Site Domain ghanaweb.live
Base Domain ghanaweb.live
Scan Status Ok
Last Scan2024-07-03T11:11:49+00:00
Next Scan 2024-07-10T11:11:49+00:00

Last Scan

Scanned2024-07-03T11:11:49+00:00
URL https://ghanaweb.live/robots.txt
Domain IPs 104.21.95.42, 172.67.142.251, 2606:4700:3034::ac43:8efb, 2606:4700:3037::6815:5f2a
Response IP 104.21.95.42
Found Yes
Hash d9c67bbe445f4d240a38bccf99afcb43e658160b73a82b84d9e99231e716828c
SimHash cb3555504c13

Groups

mediapartners-google

Rule Path
Disallow

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /headlines/
Disallow /GhanaHomePage/classifieds/archive.php
Disallow /sil/
Disallow /cdn-cgi/
Disallow *feedback.php
Disallow /validate_user.php?url=*

Other Records

Field Value
sitemap https://www.ghanaweb.live/sitemaps/sitemap.xml
sitemap https://www.ghanaweb.live/sitemaps/articles.xml
sitemap https://www.ghanaweb.live/sitemaps/news.xml
sitemap https://www.ghanaweb.live/sitemaps/videos.xml

Comments

  • Disallow: /GhanaHomePage/classifieds/show.photo.php
  • Disallow: /GhanaHomePage/world/
  • Disallow: *comment=
  • Disallow: *?audio=1
  • Disallow: *&nav=