gvm.com.tw
robots.txt

Robots Exclusion Standard data for gvm.com.tw

Resource Scan

Scan Details

Site Domain gvm.com.tw
Base Domain gvm.com.tw
Scan Status Ok
Last Scan2024-05-15T00:03:04+00:00
Next Scan 2024-05-22T00:03:04+00:00

Last Scan

Scanned2024-05-15T00:03:04+00:00
URL https://www.gvm.com.tw/robots.txt
Domain IPs 108.157.254.17, 108.157.254.56, 108.157.254.57, 108.157.254.86, 2600:9000:2753:1200:8:dd5d:1f80:93a1, 2600:9000:2753:1e00:8:dd5d:1f80:93a1, 2600:9000:2753:5a00:8:dd5d:1f80:93a1, 2600:9000:2753:7800:8:dd5d:1f80:93a1, 2600:9000:2753:7e00:8:dd5d:1f80:93a1, 2600:9000:2753:9a00:8:dd5d:1f80:93a1, 2600:9000:2753:bc00:8:dd5d:1f80:93a1, 2600:9000:2753:ee00:8:dd5d:1f80:93a1
Response IP 108.157.254.57
Found Yes
Hash 85a1c22e090f1681b49bfd8b1683b52a8b386beb7dba5a5916a17a307bb7a42d
SimHash d81444798c90

Groups

*

Rule Path
Disallow /public
Disallow /mini
Disallow /error_404
Disallow /*article/*?article_further_1_1
Disallow /*article/*?article_further_1_2
Disallow /*article/*?article_further_1_3
Disallow /*article/*?article_further_1_4
Disallow /*article/*?fbclid*
Disallow /*article/*?tags
Disallow /*article/*?article_also_1
Disallow /*article/*?article_also_2
Disallow /*article/*?article_also_3
Disallow /*article/*?blog

twitterbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gvm.com.tw/sitemap