punchfork.com
robots.txt

Robots Exclusion Standard data for punchfork.com

Resource Scan

Scan Details

Site Domain punchfork.com
Base Domain punchfork.com
Scan Status Ok
Last Scan2024-10-02T09:45:17+00:00
Next Scan 2024-11-01T09:45:17+00:00

Last Scan

Scanned2024-10-02T09:45:17+00:00
URL https://punchfork.com/robots.txt
Redirect https://www.punchfork.com/robots.txt
Redirect Domain www.punchfork.com
Redirect Base punchfork.com
Domain IPs 3.16.87.35
Redirect IPs 3.16.87.35
Response IP 3.16.87.35
Found Yes
Hash 74d720850de44714134063d9450a767cf75bd1cb00a3e22bd668cca9197efb8d
SimHash 431f9cd2ea9c

Groups

*

Rule Path
Disallow /r/
Disallow /unsubscribe
Disallow /ocu
Disallow /wiring/
Disallow /*/totaltime/
Disallow /*/new/diet/
Disallow /*/top/diet/
Disallow /*/trending/diet/

bingbot

Rule Path
Disallow /r/
Disallow /unsubscribe
Disallow /ocu
Disallow /wiring/
Disallow /*/totaltime/
Disallow /*/new/diet/
Disallow /*/top/diet/
Disallow /*/trending/diet/

Other Records

Field Value
crawl-delay 1

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

mixrankbot

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

garlik

Rule Path
Disallow /

mtrobot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

ezoicbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

seostar

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

brightedge crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

qwantbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

Other Records

Field Value
sitemap https://www.punchfork.com/sitemap.xml