mhooc.heraldcorp.com
robots.txt

Robots Exclusion Standard data for mhooc.heraldcorp.com

Resource Scan

Scan Details

Site Domain mhooc.heraldcorp.com
Base Domain heraldcorp.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-06-04T22:29:47+00:00
Next Scan 2024-06-11T22:29:47+00:00

Last Successful Scan

Scanned2024-05-04T22:28:58+00:00
URL http://mhooc.heraldcorp.com/robots.txt
Redirect http://biz.heraldcorp.com/robots.txt
Redirect Domain biz.heraldcorp.com
Redirect Base heraldcorp.com
Redirect IPs 110.93.135.40
Response IP 110.93.135.40
Found Yes
Hash f04c1cd9c98eb7233b2abd57f2be06210a003510aa60083d0ca6c7b639798edd
SimHash 6b8609035cf3

Groups

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

bingbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /

bingpreview

Rule Path
Allow /

facebot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

popin_agent

Rule Path
Allow /

yeti

Rule Path
Allow /

google search console

Rule Path
Allow /

googlebot/2.1

Rule Path
Allow /

googlebot-smartphone

Rule Path
Allow /

googlebot
googlebot-news
googlebot-image
bingbot
msnbot
msnbot-media
bingpreview
facebot
twitterbot
popin_agent
yeti
google search console
googlebot/2.1
googlebot-smartphone

Rule Path
Disallow /news/
Disallow /realty/
Disallow /wealth/
Disallow /opinien/
Disallow /life/
Disallow /sports/
Disallow /subsc/
Disallow /policy/
Disallow /mypage/
Disallow /paoin_heraldbiz/
Disallow /search/
Disallow /clean/
Disallow /global_insite/

Other Records

Field Value
sitemap http://biz.heraldcorp.com/sitemap_section.xml