blog.dataengineerthings.org
robots.txt

Robots Exclusion Standard data for blog.dataengineerthings.org

Resource Scan

Scan Details

Site Domain blog.dataengineerthings.org
Base Domain dataengineerthings.org
Scan Status Ok
Last Scan2025-09-24T08:29:22+00:00
Next Scan 2025-10-08T08:29:22+00:00

Last Scan

Scanned2025-09-24T08:29:22+00:00
URL https://blog.dataengineerthings.org/robots.txt
Redirect https://blog.dataengineerthings.org/robots.txt?gi=40ace99266a3
Domain IPs 162.159.152.4, 162.159.153.4
Response IP 162.159.153.4
Found Yes
Hash eba5f5934ac65161ef8813e9e4b14ca7ca20ea61d17efb7af4b1917d06608261
SimHash 691cbb40c772

Groups

*

Rule Path
Disallow /m/
Disallow /me/
Disallow /%40me$
Disallow /%40me/
Disallow /*/edit$
Disallow /*/*/edit$
Disallow /media/
Disallow /p/*/share
Disallow /r/
Disallow /trending
Disallow /search?q$
Disallow /search?q=
Disallow /*/search?q=
Disallow /*/search/*?q=
Disallow /*/*source%3D
Allow /_/api/users/*/meta
Allow /_/api/users/*/profile/stream
Allow /_/api/posts/*/responses
Allow /_/api/posts/*/responsesStream
Allow /_/api/posts/*/related

amazonbot
applebot-extended
bytespider
claudebot
facebookbot
googleother
gptbot
meta-externalagent

Rule Path
Disallow /
Allow /about
Allow /business
Allow /earn
Allow /gift
Allow /membership
Allow /partner-program
Allow /verified-authors

Other Records

Field Value
sitemap https://blog.dataengineerthings.org/sitemap/sitemap.xml

Warnings

  • `license` is not a known field.