ucmsajmproxyeastus.appservicesenveastus.p.azurewebsites.net
robots.txt

Resource Scan

Scan Details

Site Domain ucmsajmproxyeastus.appservicesenveastus.p.azurewebsites.net
Base Domain p.azurewebsites.net
Scan Status Ok
Last Scan2025-06-17T00:46:03+00:00
Next Scan 2025-07-01T00:46:03+00:00

Last Scan

Scanned2025-06-17T00:46:03+00:00
URL https://ucmsajmproxyeastus.appservicesenveastus.p.azurewebsites.net/robots.txt
Domain IPs 20.121.92.252
Response IP 20.121.92.252
Found Yes
Hash 2dc62d987c2ff9147d02d2d6091faf32d42d5bbebbe479e1b3ced332ee2dd2b7
SimHash 7148157fe733

Groups

*

Rule Path
Disallow /api
Disallow /asset-manifest.json
Allow /search/$
Disallow /search/
Disallow /home/search?q=

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ajmubasher.net/sitemap.xml
sitemap https://www.ajmubasher.net/news-sitemap.xml
sitemap https://www.ajmubasher.net/sitemaps/article-archive.xml
sitemap https://www.ajmubasher.net/sitemaps/article-new.xml
sitemap https://www.ajmubasher.net/sitemaps/video-archive.xml
sitemap https://www.ajmubasher.net/sitemaps/video-new.xml

Comments

  • Al Jazeera Media Network content is made available for your personal, non-commercial
  • use subject to our Terms and Conditions:
  • https://www.aljazeera.com/terms-and-conditions/
  • Any other uses are not permitted, including but not limited to:
  • (1) the development of any software, machine learning, artificial intelligence (AI),
  • and/or large language models (LLMs);
  • (2) text and data mining activities;
  • (3) creating or providing archived or cached data sets containing our content to others; and/or
  • (4) any commercial purposes.
  • Use of any device, tool, or process designed to data mine or scrape the content
  • using automated means is prohibited without prior written permission from
  • Al Jazeera Media Network. Contact https://network.aljazeera.net/en/contact for assistance.
  • Disallow Rules
  • Sitemaps