usinenouvelle.com
robots.txt

Robots Exclusion Standard data for usinenouvelle.com

Resource Scan

Scan Details

Site Domain usinenouvelle.com
Base Domain usinenouvelle.com
Scan Status Ok
Last Scan2024-11-19T09:03:13+00:00
Next Scan 2024-11-26T09:03:13+00:00

Last Scan

Scanned2024-11-19T09:03:13+00:00
URL https://www.usinenouvelle.com/robots.txt
Domain IPs 104.16.68.100, 104.16.69.100
Response IP 104.16.68.100
Found Yes
Hash c43d1a8dff52bfd8e28f26c9d37f86b591bf54aaa9803e4d250d4c8719e99d52
SimHash fb3a4d94ab90

Groups

*

Rule Path
Disallow /usn
Disallow /notule
Disallow /recherche%3D*
Disallow /*.pdf$
Disallow /*.php$
Disallow /expo/?p=11
Disallow /expo/?p=60
Disallow /expo/?p=15
Disallow /expo/tpme-o3976.html
Disallow /expo/le-magasin-o4622.html
Disallow /expo/tpme-e3976.html
Disallow /expo/le-magasin-e4622.html
Disallow /essai
Disallow /essai-confirmation
Disallow /catalogue/
Disallow /*.cfm*
Disallow /expo/?p=22
Disallow /expo/?p=02
Disallow /expo/?p=2
Disallow /mon-compte/*
Disallow /expo/?p=20
Disallow /expo/?p=30
Disallow /expo/js/data_map.*
Disallow /mein-konto/
Disallow /my-account/
Disallow /expo/ajax.php?*
Disallow /*_product_list_item.jpg
Disallow /expo/js/infoprodata.js
Disallow /preview%3Dtrue
Disallow /expo/ajax.php?c=*
Disallow /expo/?p=90*
Disallow /*/1000

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.usinenouvelle.com/sitemaps/index.xml
sitemap https://www.usinenouvelle.com/sitemaps/edito.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_3/news.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_4/news.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_5/news.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_6/news.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_8/news.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_9/news.xml
sitemap https://www.usinenouvelle.com/sitemaps/parution_10/news.xml