monacomatin.mc
robots.txt

Robots Exclusion Standard data for monacomatin.mc

Resource Scan

Scan Details

Site Domain monacomatin.mc
Base Domain monacomatin.mc
Scan Status Ok
Last Scan2024-11-11T20:48:31+00:00
Next Scan 2024-11-18T20:48:31+00:00

Last Scan

Scanned2024-11-11T20:48:31+00:00
URL https://monacomatin.mc/robots.txt
Redirect https://www.monacomatin.mc/robots.txt
Redirect Domain www.monacomatin.mc
Redirect Base monacomatin.mc
Domain IPs 80.94.98.229, 80.94.98.231
Redirect IPs 80.94.98.229, 80.94.98.231
Response IP 80.94.98.231
Found Yes
Hash 3337206b3e34810c94876bbdef5ac24e57fb360cd1cac57411564b6b19736438
SimHash 0a86587509a3

Groups

*

Rule Path
Disallow /recherche?search=*
Disallow /oa
Disallow /user*
Disallow /a/
Disallow /edition-du-jour/lire
Disallow /auth/
Disallow /index.php/*
Disallow /*/get-token*
Disallow /*/oaToken/*
Disallow /newspapers/read/*
Disallow /carnet-avis-deces*

Other Records

Field Value
crawl-delay 10

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.monacomatin.mc/sitemap.xml
sitemap https://www.monacomatin.mc/googlenews.xml

Comments

  • Sitemap