al-watan.com
robots.txt

Robots Exclusion Standard data for al-watan.com

Resource Scan

Scan Details

Site Domain al-watan.com
Base Domain al-watan.com
Scan Status Ok
Last Scan2024-05-24T20:08:02+00:00
Next Scan 2024-05-31T20:08:02+00:00

Last Scan

Scanned2024-05-24T20:08:02+00:00
URL https://al-watan.com/robots.txt
Redirect https://www.al-watan.com/robots.txt
Redirect Domain www.al-watan.com
Redirect Base al-watan.com
Domain IPs 104.26.4.251, 104.26.5.251, 172.67.72.38, 2606:4700:20::681a:4fb, 2606:4700:20::681a:5fb, 2606:4700:20::ac43:4826
Redirect IPs 104.26.4.251, 104.26.5.251, 172.67.72.38, 2606:4700:20::681a:4fb, 2606:4700:20::681a:5fb, 2606:4700:20::ac43:4826
Response IP 104.26.5.251
Found Yes
Hash 95197ad079c70425091988da9a979228084df75721c369936f2df264dc5fa8fa
SimHash 91ed70474ffc

Groups

*

Rule Path
Disallow /ajax/*
Disallow /print*
Disallow /getRelatedArticles*
Disallow /getMostReadArticles*
Disallow /article_count/*
Disallow /get-menu-header*
Disallow /search*
Disallow /morearticles/*
Disallow /article.php*
Disallow /login-mgt
Disallow /*.php
Disallow /*.pdf
Disallow /archive/*
Disallow /rss
Disallow /rssFeed/*
Disallow /widget/*
Disallow */page/*