chat.nrj.fr
robots.txt

Robots Exclusion Standard data for chat.nrj.fr

Resource Scan

Scan Details

Site Domain chat.nrj.fr
Base Domain nrj.fr
Scan Status Ok
Last Scan2024-05-25T00:54:52+00:00
Next Scan 2024-06-01T00:54:52+00:00

Last Scan

Scanned2024-05-25T00:54:52+00:00
URL https://chat.nrj.fr/robots.txt
Domain IPs 185.40.101.46
Response IP 185.40.101.46
Found Yes
Hash 4e8bd55d10cd9c56555a54bd72c3f8409e56ea75d3890067da52328b88f2bb84
SimHash 4154c3140713

Groups

mj12bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

geekystats.com crawler

Rule Path
Disallow /

jugendschutzprogramm-crawler; info: http://www.jugendschutzprogramm.de

Rule Path
Disallow /

hybridbot (hybrid.ru/about. if our bot caused problems please contact us. contact email: m.lyashkov@targetix.net)

Rule Path
Disallow /

mozilla/5.0 (compatible; grapeshotcrawler/2.0; +http://www.grapeshot.co.uk/crawler.php)

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sirdatabot (+https://semantic-api.docs.sirdata.net/contextual-api/contextual-api/introduction)

Rule Path
Disallow /Account/Activate/

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://chat.nrj.fr/sitemap.xml