gq.com.tw
robots.txt
Robots Exclusion Standard data for gq.com.tw
Resource Scan
Scan Details
Site Domain | gq.com.tw |
Base Domain | gq.com.tw |
Scan Status | Ok |
Last Scan | 2024-06-20T06:34:42+00:00 |
Next Scan | 2024-06-27T06:34:42+00:00 |
Last Scan
Scanned | 2024-06-20T06:34:42+00:00 |
URL | https://gq.com.tw/robots.txt |
Redirect | https://www.gq.com.tw/robots.txt |
Redirect Domain | www.gq.com.tw |
Redirect Base | gq.com.tw |
Domain IPs | 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133 |
Redirect IPs | 18.155.68.121, 18.155.68.17, 18.155.68.3, 18.155.68.72, 2600:9000:2816:1000:f:37c0:ca00:93a1, 2600:9000:2816:1800:f:37c0:ca00:93a1, 2600:9000:2816:3e00:f:37c0:ca00:93a1, 2600:9000:2816:7e00:f:37c0:ca00:93a1, 2600:9000:2816:800:f:37c0:ca00:93a1, 2600:9000:2816:9200:f:37c0:ca00:93a1, 2600:9000:2816:cc00:f:37c0:ca00:93a1, 2600:9000:2816:f400:f:37c0:ca00:93a1 |
Response IP | 18.155.68.121 |
Found | Yes |
Hash | 303a3aba2f2fd97d06507a2923fef0231e677ab18a81370e56f40f5d68f0c286 |
SimHash | 4404c9618731 |
Groups
*
Rule | Path |
---|---|
Disallow | *?image= |
Disallow | */image/ |
Disallow | /*%7B%7Burl%7D%7D |
Disallow | /*%7B%7Burl%7D%7D |
Disallow | /*?redirectURL= |
Disallow | /preview/ |
Disallow | /*%7B%7Bsection%7D%7D |
Disallow | /*%7B%7Bsection%7D%7D |
Disallow | /auth/* |
Disallow | /account/* |
Disallow | */blog/ |
Disallow | /product/ |
Disallow | /user-context?referrer |
Disallow | /services.min.js?_= |
Other Records
Field | Value |
---|---|
sitemap | https://www.gq.com.tw/sitemap.xml |
sitemap | https://www.gq.com.tw/feed/sitemap/sitemap-google-news |
sitemap | https://www.gq.com.tw/feed/rss |
sitemap | https://www.gq.com.tw/categories-sitemap.xml |
sitemap | https://www.gq.com.tw/contributors-sitemap.xml |
sitemap | https://www.gq.com.tw/branded-sitemap.xml |
sitemap | https://www.gq.com.tw/bundles-sitemap.xml |