romepratique.com
robots.txt

Robots Exclusion Standard data for romepratique.com

Resource Scan

Scan Details

Site Domain romepratique.com
Base Domain romepratique.com
Scan Status Ok
Last Scan2024-09-27T22:14:56+00:00
Next Scan 2024-10-04T22:14:56+00:00

Last Scan

Scanned2024-09-27T22:14:56+00:00
URL https://romepratique.com/robots.txt
Domain IPs 78.40.8.65
Response IP 78.40.8.65
Found Yes
Hash 5265a741bde9483975bf95cf402d5b1c009ef55960bca0795a32bd2eef8dbb19
SimHash 7a33d14fabf3

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

textbulkerbot

Rule Path
Disallow /

*

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /Forums/
Disallow /page/
Disallow /tag/
Disallow /feed/
Disallow /?s=
Disallow /events/
Disallow /category/
Disallow /venue/
Disallow /rhc-past-events/
Disallow /rhc-upcoming-events/
Disallow /?event_rdate=
Disallow /?attachment_id=

Other Records

Field Value
sitemap https://www.romepratique.com/sitemap_index.xml