myportail.com
robots.txt

Robots Exclusion Standard data for myportail.com

Resource Scan

Scan Details

Site Domain myportail.com
Base Domain myportail.com
Scan Status Ok
Last Scan2026-01-31T10:04:42+00:00
Next Scan 2026-02-07T10:04:42+00:00

Last Scan

Scanned2026-01-31T10:04:42+00:00
URL https://myportail.com/robots.txt
Domain IPs 104.21.28.99, 172.67.145.210, 2606:4700:3033::6815:1c63, 2606:4700:3035::ac43:91d2
Response IP 104.21.28.99
Found Yes
Hash 45a693bd451f7355ebe9e4d6bbd1a2a32ae3d65de9a74e71b0812d186f475c83
SimHash 21205b0007d9

Groups

*

Rule Path
Allow /
Disallow /katib.php
Disallow /author-config.php
Disallow /author-dashboard.php
Disallow /author-login.php
Disallow /author-logout.php
Disallow /author-register-existing.php
Disallow /add_article_modif.php
Disallow /add_article.php
Disallow /admin-articles.php
Disallow /admin-authors.php
Disallow /admin-classification-themes.php
Disallow /admin-classification.php
Disallow /admin-responses.php

*

Rule Path
Disallow /auteur/*/article/
Disallow /auteur/*/
Allow /article.php
Allow /article-auteur.php
Allow /articles-faouzi-messeoud.php

Other Records

Field Value
sitemap https://www.myportail.com/sitemap.xml

Comments

  • robots.txt for myportail.com
  • Block subscriber-only pages
  • Block problematic URL patterns
  • Allow clean URLs
  • Sitemap location