chat.cheriefm.fr
robots.txt

Robots Exclusion Standard data for chat.cheriefm.fr

Resource Scan

Scan Details

Site Domain chat.cheriefm.fr
Base Domain cheriefm.fr
Scan Status Ok
Last Scan2024-10-09T04:09:13+00:00
Next Scan 2024-10-16T04:09:13+00:00

Last Scan

Scanned2024-10-09T04:09:13+00:00
URL https://chat.cheriefm.fr/robots.txt
Domain IPs 185.40.101.46
Response IP 185.40.101.46
Found Yes
Hash c9273fcf4db101a4dec8175e381e605681233ffa9e33b8d3e10ffa2d04707dce
SimHash 415dc3102633

Groups

mj12bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

geekystats.com crawler

Rule Path
Disallow /

jugendschutzprogramm-crawler; info: http://www.jugendschutzprogramm.de

Rule Path
Disallow /

hybridbot (hybrid.ru/about. if our bot caused problems please contact us. contact email: m.lyashkov@targetix.net)

Rule Path
Disallow /

mozilla/5.0 (compatible; grapeshotcrawler/2.0; +http://www.grapeshot.co.uk/crawler.php)

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sirdatabot (+https://semantic-api.docs.sirdata.net/contextual-api/contextual-api/introduction)

Rule Path
Disallow /Account/Activate/

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://chat.cheriefm.fr/sitemap.xml