hooc.heraldcorp.com
robots.txt
Robots Exclusion Standard data for hooc.heraldcorp.com
Resource Scan
Scan Details
Site Domain | hooc.heraldcorp.com |
Base Domain | heraldcorp.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Request timed out. |
Last Scan | 2024-05-25T02:13:17+00:00 |
Next Scan | 2024-06-24T02:13:17+00:00 |
Last Successful Scan
Scanned | 2024-04-03T01:59:14+00:00 |
URL | http://hooc.heraldcorp.com/robots.txt |
Redirect | http://biz.heraldcorp.com/robots.txt |
Redirect Domain | biz.heraldcorp.com |
Redirect Base | heraldcorp.com |
Domain IPs | 110.93.143.160 |
Redirect IPs | 110.93.135.40 |
Response IP | 110.93.135.40 |
Found | Yes |
Hash | f04c1cd9c98eb7233b2abd57f2be06210a003510aa60083d0ca6c7b639798edd |
SimHash | 6b8609035cf3 |
Groups
googlebot
googlebot-news
googlebot-image
bingbot
msnbot
msnbot-media
bingpreview
facebot
twitterbot
popin_agent
yeti
google search console
googlebot/2.1
googlebot-smartphone
Rule | Path |
---|---|
Disallow | /news/ |
Disallow | /realty/ |
Disallow | /wealth/ |
Disallow | /opinien/ |
Disallow | /life/ |
Disallow | /sports/ |
Disallow | /subsc/ |
Disallow | /policy/ |
Disallow | /mypage/ |
Disallow | /paoin_heraldbiz/ |
Disallow | /search/ |
Disallow | /clean/ |
Disallow | /global_insite/ |
Other Records
Field | Value |
---|---|
sitemap | http://biz.heraldcorp.com/sitemap_section.xml |