nouveauxplaisirs.fr
robots.txt

Robots Exclusion Standard data for nouveauxplaisirs.fr

Resource Scan

Scan Details

Site Domain nouveauxplaisirs.fr
Base Domain nouveauxplaisirs.fr
Scan Status Ok
Last Scan2024-09-24T16:35:57+00:00
Next Scan 2024-10-24T16:35:57+00:00

Last Scan

Scanned2024-09-24T16:35:57+00:00
URL https://nouveauxplaisirs.fr/robots.txt
Domain IPs 109.234.166.179
Response IP 109.234.166.179
Found Yes
Hash 7d0157427cb6ef8f5e3730f3ef386b88c3f5c1445fdbf2720052911c365a664d
SimHash 8e15d401bdf2

Groups

*

Rule Path
Disallow /wp-json/
Disallow /wp-admin
Disallow /emergency
Disallow /tools
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.zip$
Disallow /*.log$
Disallow /*.tmp$
Disallow /*.csv$
Disallow /wp-login.php
Disallow /readme.html
Disallow /?s=*
Disallow /search/*

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nouveauxplaisirs.fr/sitemap_index.xml

Comments

  • Global rules
  • -----------------
  • We're experimenting with blocking search results to prevent search result spam
  • Ban bots that don't benefit us.
  • --------------------------------
  • Sitemap
  • -----------------