eurecat.org
robots.txt

Robots Exclusion Standard data for eurecat.org

Resource Scan

Scan Details

Site Domain eurecat.org
Base Domain eurecat.org
Scan Status Ok
Last Scan2024-09-17T13:48:16+00:00
Next Scan 2024-10-17T13:48:16+00:00

Last Scan

Scanned2024-09-17T13:48:16+00:00
URL https://eurecat.org/robots.txt
Domain IPs 75.102.57.201
Response IP 75.102.57.201
Found Yes
Hash f5bf56241fee5d902340735624f9c78fdaaa71c1126e3830972a10bd0d5a9fa5
SimHash db4bfa40cd65

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

*

Rule Path
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /*blackhole
Disallow /?blackhole

*

Rule Path
Disallow /wp-content/uploads/wp-import-export-lite/

Comments

  • WP Import Export Rule