muckrack.com
robots.txt
Robots Exclusion Standard data for muckrack.com
Resource Scan
Scan Details
Site Domain | muckrack.com |
Base Domain | muckrack.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-09-01T05:18:05+00:00 |
Next Scan | 2025-11-30T05:18:05+00:00 |
Last Successful Scan
Scanned | 2023-10-19T07:31:43+00:00 |
URL | https://muckrack.com/robots.txt |
Domain IPs | 104.18.12.41, 104.18.13.41, 2606:4700::6812:c29, 2606:4700::6812:d29 |
Response IP | 104.18.13.41 |
Found | Yes |
Hash | ad203fc390e94b731293da4394ae61fb3fac0a369e81afe8d2ad614319c0aef5 |
SimHash | 8b87c066b112 |
Groups
*
Rule | Path |
---|---|
Disallow | *.atom$ |
Disallow | *.json$ |
Disallow | /search/* |
Disallow | /ajax/* |
Disallow | */statuses/* |
Disallow | /*?*q=* |
Disallow | /*?*next=* |
Disallow | /*?url=* |
Disallow | /feeds/* |
Disallow | /*?*page=* |
Disallow | /link/* |
Disallow | /account/* |
Disallow | /styleguide/* |
Disallow | /trending/* |
Allow | /humans.txt |
gptbot
Rule | Path |
---|---|
Disallow | / |
Allow | /about |
Allow | /blog/ |
Allow | /case-studies |
Allow | /collaboration |
Allow | /daily |
Allow | /events |
Allow | /guides |
Allow | /in-the-news |
Allow | /journalists |
Allow | /media-database |
Allow | /media-monitoring-alerts |
Allow | /media-pitching-personalized-outreach |
Allow | /overview |
Allow | /pr-analytics-reporting-measurement-software |
Allow | /pricing |
Allow | /research |
Allow | /webinars |
Other Records
Field | Value |
---|---|
sitemap | https://muckrack.com/sitemap.xml |
Comments