marlincrawler.com
robots.txt

Robots Exclusion Standard data for marlincrawler.com

Resource Scan

Scan Details

Site Domain marlincrawler.com
Base Domain marlincrawler.com
Scan Status Ok
Last Scan2025-10-22T10:21:31+00:00
Next Scan 2025-11-21T10:21:31+00:00

Last Scan

Scanned2025-10-22T10:21:31+00:00
URL https://marlincrawler.com/robots.txt
Domain IPs 104.21.4.95, 172.67.153.246, 2606:4700:3032::ac43:99f6, 2606:4700:3034::6815:45f
Response IP 104.21.4.95
Found Yes
Hash 322a0d51a27109b5d03c0c9f7eb1011b8619819ed0a85d21ea4b4294c7b23572
SimHash e92059686692

Groups

*

Rule Path
Disallow /*add-to-cart%3D*
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /?s=
Disallow /search/
Disallow /log-in?
Disallow /wp-login.php
Disallow /wp-signup.php