aljazeera.net
robots.txt

Robots Exclusion Standard data for aljazeera.net

Resource Scan

Scan Details

Site Domain aljazeera.net
Base Domain aljazeera.net
Scan Status Ok
Last Scan2024-05-03T03:07:11+00:00
Next Scan 2024-05-10T03:07:11+00:00

Last Scan

Scanned2024-05-03T03:07:11+00:00
URL https://aljazeera.net/robots.txt
Redirect https://www.aljazeera.net:443/robots.txt
Redirect Domain www.aljazeera.net
Redirect Base aljazeera.net
Domain IPs 16.16.137.68, 16.16.245.117, 51.21.6.254
Redirect IPs 184.51.97.97, 2600:1413:1:484::2392, 2600:1413:1:497::2392
Response IP 184.51.97.97
Found Yes
Hash 16d4734a9972ba1f8b43a9736e94f12092d876f7bba502fa903b14c36b835ea5
SimHash 65085474c99b

Groups

*

Rule Path
Disallow /api
Disallow /asset-manifest.json
Allow /search/$
Disallow /search/
Disallow /home/search?q=

Other Records

Field Value
sitemap https://www.aljazeera.net/sitemap.xml
sitemap https://www.aljazeera.net/news-sitemap.xml
sitemap https://www.aljazeera.net/sitemaps/article-archive.xml
sitemap https://www.aljazeera.net/sitemaps/article-new.xml
sitemap https://www.aljazeera.net/sitemaps/video-archive.xml
sitemap https://www.aljazeera.net/sitemaps/video-new.xml