biz.heraldcorp.com
robots.txt
Robots Exclusion Standard data for biz.heraldcorp.com
Resource Scan
Scan Details
Site Domain | biz.heraldcorp.com |
Base Domain | heraldcorp.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Request timed out. |
Last Scan | 2024-11-13T08:23:12+00:00 |
Next Scan | 2024-12-13T08:23:12+00:00 |
Last Successful Scan
Scanned | 2024-09-22T08:21:54+00:00 |
URL | https://biz.heraldcorp.com/robots.txt |
Domain IPs | 110.93.135.40 |
Response IP | 110.93.135.40 |
Found | Yes |
Hash | f04c1cd9c98eb7233b2abd57f2be06210a003510aa60083d0ca6c7b639798edd |
SimHash | 6b8609035cf3 |
Groups
googlebot
googlebot-news
googlebot-image
bingbot
msnbot
msnbot-media
bingpreview
facebot
twitterbot
popin_agent
yeti
google search console
googlebot/2.1
googlebot-smartphone
Rule | Path |
---|---|
Disallow | /news/ |
Disallow | /realty/ |
Disallow | /wealth/ |
Disallow | /opinien/ |
Disallow | /life/ |
Disallow | /sports/ |
Disallow | /subsc/ |
Disallow | /policy/ |
Disallow | /mypage/ |
Disallow | /paoin_heraldbiz/ |
Disallow | /search/ |
Disallow | /clean/ |
Disallow | /global_insite/ |
Other Records
Field | Value |
---|---|
sitemap | http://biz.heraldcorp.com/sitemap_section.xml |