matthewzhaocc.com
robots.txt

Robots Exclusion Standard data for matthewzhaocc.com

Resource Scan

Scan Details

Site Domain matthewzhaocc.com
Base Domain matthewzhaocc.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-11-21T13:42:49+00:00
Next Scan 2025-11-28T13:42:49+00:00

Last Successful Scan

Scanned2025-11-06T12:53:48+00:00
URL https://matthewzhaocc.com/robots.txt
Redirect https://matthewzhaocc.com/robots.txt?gi=186176c2fc56
Domain IPs 162.159.152.4, 162.159.153.4
Response IP 162.159.153.4
Found Yes
Hash 2475d1bfa00d83ec238387da45b16c2973295931f43ca019608615cd559bd71d
SimHash 693cbbc44372

Groups

*

Rule Path
Disallow /m/
Disallow /me/
Disallow /%40me$
Disallow /%40me/
Disallow /*/edit$
Disallow /*/*/edit$
Disallow /media/
Disallow /p/*/share
Disallow /r/
Disallow /trending
Disallow /search?q$
Disallow /search?q=
Disallow /*/search?q=
Disallow /*/search/*?q=
Disallow /*/*source%3D
Allow /_/api/users/*/meta
Allow /_/api/users/*/profile/stream
Allow /_/api/posts/*/responses
Allow /_/api/posts/*/responsesStream
Allow /_/api/posts/*/related

amazonbot
applebot-extended
bytespider
claudebot
facebookbot
googleother
gptbot
meta-externalagent

Rule Path
Disallow /
Allow /about
Allow /business
Allow /earn
Allow /gift
Allow /membership
Allow /partner-program
Allow /verified-authors

Other Records

Field Value
sitemap https://matthewzhaocc.com/sitemap/sitemap.xml

Warnings

  • `license` is not a known field.