sfgate.com
robots.txt
Robots Exclusion Standard data for sfgate.com
Resource Scan
Scan Details
Site Domain | sfgate.com |
Base Domain | sfgate.com |
Scan Status | Ok |
Last Scan | 2024-04-24T07:40:07+00:00 |
Next Scan | 2024-05-01T07:40:07+00:00 |
Last Scan
Scanned | 2024-04-24T07:40:07+00:00 |
URL | https://sfgate.com/robots.txt |
Redirect | https://www.sfgate.com/robots.txt |
Redirect Domain | www.sfgate.com |
Redirect Base | sfgate.com |
Domain IPs | 98.129.228.59 |
Redirect IPs | 151.101.0.200, 151.101.128.200, 151.101.192.200, 151.101.64.200 |
Response IP | 199.232.44.200 |
Found | Yes |
Hash | e7c339ee10b61b92d68b26558255ef792da8f7f25a5b234635209bbf39c26c90 |
SimHash | e1bb4c46aa53 |
Groups
*
Rule | Path |
---|---|
Disallow | /style/beauty/hearstmagazines/ |
Disallow | /style/fashion/hearstmagazines/ |
Disallow | /living/relationships/hearstmagazines/ |
Disallow | /homeandgarden/home/hearstmagazines/ |
Disallow | /living/wellness/hearstmagazines/ |
Disallow | /sponsored |
Disallow | /sponsoredarticles/ |
Disallow | /business/press-releases/ |
Disallow | /events/ |
Disallow | /movies/templates/listings |
Disallow | /food/dbapps/restaurants |
Disallow | /sso/action/logout |
Disallow | /general/dbapps/404 |
Disallow | /coupons/visit |
Disallow | /search |
googlebot-news
Rule | Path |
---|---|
Disallow | /business/prweb/ |
Disallow | /horoscopes/ |
Disallow | /entertainment/article/Minerva-s-horoscope |
*
Rule | Path |
---|---|
Disallow | /413gkwMT/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.sfgate.com/sitemap.xml |
sitemap | https://www.sfgate.com/sitemap_news.xml |