lecafedugeek.fr
robots.txt

Robots Exclusion Standard data for lecafedugeek.fr

Resource Scan

Scan Details

Site Domain lecafedugeek.fr
Base Domain lecafedugeek.fr
Scan Status Ok
Last Scan2024-09-26T16:29:03+00:00
Next Scan 2024-10-03T16:29:03+00:00

Last Scan

Scanned2024-09-26T16:29:03+00:00
URL https://lecafedugeek.fr/robots.txt
Domain IPs 144.91.108.93, 2a02:c207:2137:9858::1
Response IP 144.91.108.93
Found Yes
Hash c10df21130b6265bfba077eb0afef7f7efe210f0686c1a41c8d46ce4b1e8e083
SimHash e6030d00ab3a

Groups

*

Rule Path
Allow /ads/preferences/
Allow /gpt/
Allow /pagead/show_ads.js
Allow /pagead/js/adsbygoogle.js
Allow /pagead/js/*/show_ads_impl.js
Allow /static/glade.js
Allow /static/glade/
Allow /codes-promo/commentseruiner.com
Allow /ahrefs_f5fe650d41fa91a887f2d60eee49bb0654bcb5813d30c46da727e2c827c34467
Disallow /wp-admin/
Disallow /cafelogin
Disallow */trackback
Disallow /*/comments/
Disallow /cgi-bin
Disallow /*.php$
Disallow /.*inc$
Disallow /*.gz$
Disallow /*.cgi$

Other Records

Field Value
sitemap https://lecafedugeek.fr/news-sitemap.xml
sitemap https://lecafedugeek.fr/sitemap_index.xml