www2.lactualite.com
robots.txt

Robots Exclusion Standard data for www2.lactualite.com

Resource Scan

Scan Details

Site Domain www2.lactualite.com
Base Domain lactualite.com
Scan Status Ok
Last Scan2024-05-21T09:10:50+00:00
Next Scan 2024-06-20T09:10:50+00:00

Last Scan

Scanned2024-05-21T09:10:50+00:00
URL https://www2.lactualite.com/robots.txt
Domain IPs 104.26.6.65, 104.26.7.65, 172.67.74.4, 2606:4700:20::681a:641, 2606:4700:20::681a:741, 2606:4700:20::ac43:4a04
Response IP 172.67.74.4
Found Yes
Hash 1669694490aea5af58ec6725735ba33e27a0760fcf41d05f9a8aa7e6981d1936
SimHash 21041df74582

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-login.php
Allow /wp-admin/admin-ajax.php
Disallow */trackback
Disallow /*/feed
Disallow /*/comments
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz
Disallow /*.cgi
Disallow /auteurs/*/page/

ahrefsbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://lactualite.com/sitemap.xml
sitemap https://lactualite.com/post_google_news.xml