doitinparis.com
robots.txt

Robots Exclusion Standard data for doitinparis.com

Resource Scan

Scan Details

Site Domain doitinparis.com
Base Domain doitinparis.com
Scan Status Ok
Last Scan2024-11-24T08:41:06+00:00
Next Scan 2024-12-24T08:41:06+00:00

Last Scan

Scanned2024-11-24T08:41:06+00:00
URL https://doitinparis.com/robots.txt
Redirect https://www.doitinparis.com/robots.txt
Redirect Domain www.doitinparis.com
Redirect Base doitinparis.com
Domain IPs 57.128.74.207
Redirect IPs 57.128.74.207
Response IP 57.128.74.207
Found Yes
Hash f9b103e38b959301600eea92ba267d09e90efca3f01b69dcb55755a42f45859d
SimHash c8311ef0a9b3

Groups

*

Rule Path
Disallow /carrousel/
Disallow /popup/
Disallow /prev.php/
Disallow /dev/
Disallow rubriqueID%3D*
Disallow module%3D*
Disallow action%3D*
Disallow dataID%3D*
Disallow articleID%3D*
Disallow /*?print=*
Disallow /*?prev=*
Disallow /fr/search
Disallow /fr/inscription-newsletter.html
Disallow /fr/desinscription-newsletter.html
Disallow /fr/calendrier/avent-*.html

oai-searchbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

gptbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.doitinparis.com/sitemap/sitemap.xml
sitemap https://www.doitinparis.com/sitemap/sitemap-news.xml
sitemap https://www.doitinparis.com/sitemap/sitemap-authors.xml