muckrack.com
robots.txt

Robots Exclusion Standard data for muckrack.com

Resource Scan

Scan Details

Site Domain muckrack.com
Base Domain muckrack.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-01T05:18:05+00:00
Next Scan 2025-11-30T05:18:05+00:00

Last Successful Scan

Scanned2023-10-19T07:31:43+00:00
URL https://muckrack.com/robots.txt
Domain IPs 104.18.12.41, 104.18.13.41, 2606:4700::6812:c29, 2606:4700::6812:d29
Response IP 104.18.13.41
Found Yes
Hash ad203fc390e94b731293da4394ae61fb3fac0a369e81afe8d2ad614319c0aef5
SimHash 8b87c066b112

Groups

*

Rule Path
Disallow *.atom$
Disallow *.json$
Disallow /search/*
Disallow /ajax/*
Disallow */statuses/*
Disallow /*?*q=*
Disallow /*?*next=*
Disallow /*?url=*
Disallow /feeds/*
Disallow /*?*page=*
Disallow /link/*
Disallow /account/*
Disallow /styleguide/*
Disallow /trending/*
Allow /humans.txt

dotbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

buck

Rule Path
Disallow /

gptbot

Rule Path
Disallow /
Allow /about
Allow /blog/
Allow /case-studies
Allow /collaboration
Allow /daily
Allow /events
Allow /guides
Allow /in-the-news
Allow /journalists
Allow /media-database
Allow /media-monitoring-alerts
Allow /media-pitching-personalized-outreach
Allow /overview
Allow /pr-analytics-reporting-measurement-software
Allow /pricing
Allow /research
Allow /webinars

Other Records

Field Value
sitemap https://muckrack.com/sitemap.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449