thebalance.com
robots.txt
Robots Exclusion Standard data for thebalance.com
Resource Scan
Scan Details
Site Domain | thebalance.com |
Base Domain | thebalance.com |
Scan Status | Ok |
Last Scan | 2024-11-09T20:18:08+00:00 |
Next Scan | 2024-11-16T20:18:08+00:00 |
Last Scan
Scanned | 2024-11-09T20:18:08+00:00 |
URL | https://thebalance.com/robots.txt |
Redirect | https://www.thebalancemoney.com:443/robots.txt |
Redirect Domain | www.thebalancemoney.com |
Redirect Base | thebalancemoney.com |
Domain IPs | 18.215.127.208, 3.212.97.252, 34.239.50.125 |
Redirect IPs | 151.101.130.137, 151.101.194.137, 151.101.2.137, 151.101.66.137 |
Response IP | 199.232.46.137 |
Found | Yes |
Hash | 32f98a844ba99415d78913ac7e3f7909a75c629c0db0a63d006834a1c4ea3674 |
SimHash | 6d145871a9f3 |
Groups
*
Rule | Path |
---|---|
Disallow | *quizResult%3D |
Disallow | |
Disallow | *globeTest_ |
Disallow | *globeNoTest |
Disallow | *globeResource |
Disallow | *?kw |
Disallow | /embed? |
Disallow | /shop/ |
Disallow | /authentication/ |
Disallow | /newsletters/preferences/manage |
Disallow | /newsletters/preferences/unsubscribe |
Other Records
Field | Value |
---|---|
sitemap | https://www.thebalancemoney.com/sitemap.xml |
sitemap | https://www.thebalancemoney.com/google-news-sitemap.xml |