chat.rireetchansons.fr
robots.txt

Robots Exclusion Standard data for chat.rireetchansons.fr

Resource Scan

Scan Details

Site Domain chat.rireetchansons.fr
Base Domain rireetchansons.fr
Scan Status Ok
Last Scan2024-10-09T23:05:00+00:00
Next Scan 2024-10-16T23:05:00+00:00

Last Scan

Scanned2024-10-09T23:05:00+00:00
URL https://chat.rireetchansons.fr/robots.txt
Domain IPs 185.40.101.46
Response IP 185.40.101.46
Found Yes
Hash 03da0c713b86488b21557e1cf469c389016c71f4eef0cfdc7ed5f34b2778edc0
SimHash 615dc3144673

Groups

mj12bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

geekystats.com crawler

Rule Path
Disallow /

jugendschutzprogramm-crawler; info: http://www.jugendschutzprogramm.de

Rule Path
Disallow /

hybridbot (hybrid.ru/about. if our bot caused problems please contact us. contact email: m.lyashkov@targetix.net)

Rule Path
Disallow /

mozilla/5.0 (compatible; grapeshotcrawler/2.0; +http://www.grapeshot.co.uk/crawler.php)

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sirdatabot (+https://semantic-api.docs.sirdata.net/contextual-api/contextual-api/introduction)

Rule Path
Disallow /Account/Activate/

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://chat.rireetchansons.fr/sitemap.xml