flyingpenguin.com
robots.txt

Robots Exclusion Standard data for flyingpenguin.com

Resource Scan

Scan Details

Site Domain flyingpenguin.com
Base Domain flyingpenguin.com
Scan Status Ok
Last Scan2025-08-29T15:09:11+00:00
Next Scan 2025-09-28T15:09:11+00:00

Last Scan

Scanned2025-08-29T15:09:11+00:00
URL https://flyingpenguin.com/robots.txt
Domain IPs 208.66.129.140
Response IP 208.66.129.140
Found Yes
Hash 90dfcad8c83b8f873c1a2d0dfd7282476148de11156ddbca8d4cb7c2e38635fc
SimHash 430c480065f4

Groups

*

Rule Path
Disallow /*?paged=
Disallow /wp-admin/
Disallow /*blackhole
Disallow /?blackhole
Allow /wp-admin/admin-ajax.php

scrapy

Rule Path
Disallow /

scrapybot

Rule Path
Disallow /

wget

Rule Path
Disallow /

curl

Rule Path
Disallow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

Comments

  • Block common scraping bots
  • Allow good bots