ma-planete.com
robots.txt

Robots Exclusion Standard data for ma-planete.com

Resource Scan

Scan Details

Site Domain ma-planete.com
Base Domain ma-planete.com
Scan Status Ok
Last Scan2024-09-22T19:48:13+00:00
Next Scan 2024-09-29T19:48:13+00:00

Last Scan

Scanned2024-09-22T19:48:13+00:00
URL https://ma-planete.com/robots.txt
Domain IPs 172.66.40.142, 172.66.43.114, 2606:4700:3108::ac42:288e, 2606:4700:3108::ac42:2b72
Response IP 172.66.40.142
Found Yes
Hash b6d2d51e324c21aebfa29d4bc2e30103aafafa9b8a92be55be8683fe47071cbe
SimHash ff4adb564975

Groups

*

Rule Path
Allow /file/style/
Allow /file/pic/
Disallow /file/
Allow /plugins/chameleon/
Allow /plugins/phpsns_shared/
Allow /plugins/forums/
Allow /plugins/wysiwyg/
Disallow /plugins/
Disallow /blagues/print/
Disallow /trucsetastuces/print/

Other Records

Field Value
crawl-delay 120

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

Other Records

Field Value
sitemap http://ma-planete.com/sitemap.xml