mynw.com
robots.txt

Robots Exclusion Standard data for mynw.com

Resource Scan

Scan Details

Site Domain mynw.com
Base Domain mynw.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-13T09:49:34+00:00
Next Scan 2025-12-20T09:49:34+00:00

Last Successful Scan

Scanned2025-12-05T02:12:27+00:00
URL https://mynw.com/robots.txt
Domain IPs 162.159.134.42
Response IP 162.159.134.42
Found Yes
Hash b4b30638e9368289989d2fa481a66217890a4f35a3d98c9d8aa1dc1c79c566b7
SimHash 660a5950c0a0

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Disallow /*/feed
Allow /

Other Records

Field Value
crawl-delay 15

facebookexternalhit

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

nicecrawler

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

primalbot

Rule Path
Disallow /

quora-bot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://mynorthwest.com/news-sitemap.xml
sitemap https://mynorthwest.com/sitemap_index.xml

Comments

  • Disallow Rules