how-ma.com
robots.txt

Robots Exclusion Standard data for how-ma.com

Resource Scan

Scan Details

Site Domain how-ma.com
Base Domain how-ma.com
Scan Status Ok
Last Scan2025-10-12T15:41:34+00:00
Next Scan 2025-11-11T15:41:34+00:00

Last Scan

Scanned2025-10-12T15:41:34+00:00
URL https://how-ma.com/robots.txt
Redirect https://www.how-ma.com/robots.txt
Redirect Domain www.how-ma.com
Redirect Base how-ma.com
Domain IPs 104.26.2.51, 104.26.3.51, 172.67.74.93, 2606:4700:20::681a:233, 2606:4700:20::681a:333, 2606:4700:20::ac43:4a5d
Redirect IPs 104.26.2.51, 104.26.3.51, 172.67.74.93, 2606:4700:20::681a:233, 2606:4700:20::681a:333, 2606:4700:20::ac43:4a5d
Response IP 104.26.2.51
Found Yes
Hash ae23454f70fa965b3083aaf835effa01b85a6aa4c179f31a5fb29ff884148a92
SimHash 801605c77e62

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /users/sign_up/mansion*?
Disallow /users/sign_up/kodate*?
Disallow /users/sign_up/tochi*?
Disallow /users/ba_sign_up/standard/condo*?
Disallow /users/ba_sign_up/standard/house*?
Disallow /users/ba_sign_up/standard/land*?

ahrefsbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

daum

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mb2345browser

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

zh-cn

Rule Path
Disallow /

zh_cn

Rule Path
Disallow /

kinza

Rule Path
Disallow /

micromessenger

Rule Path
Disallow /

yeti

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

cincraw

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.how-ma.com/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file

Warnings

  • 2 invalid lines.