news.mixi.jp
robots.txt
Robots Exclusion Standard data for news.mixi.jp
Resource Scan
Scan Details
Site Domain | news.mixi.jp |
Base Domain | mixi.jp |
Scan Status | Ok |
Last Scan | 2024-06-02T08:21:15+00:00 |
Next Scan | 2024-07-02T08:21:15+00:00 |
Last Scan
Scanned | 2024-06-02T08:21:15+00:00 |
URL | https://news.mixi.jp/robots.txt |
Domain IPs | 18.180.72.219, 54.238.227.135 |
Response IP | 54.238.227.135 |
Found | Yes |
Hash | 56991243e1d992976e4b3b08d79f1fcb30fa63fa2a9279d266674c4f20187fa0 |
SimHash | 20440ac10499 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
Allow | /$ |
Allow | /ads.txt |
Allow | /static/ |
Allow | /list_news.pl |
Allow | /list_news_category.pl |
Allow | /list_news_category_touch.pl |
Allow | /list_news_media.pl |
Allow | /list_news_topics.pl |
Allow | /list_quote.pl |
Allow | /list_special_topics_article.pl |
Allow | /search_news.pl |
Allow | /show_media.pl |
Allow | /show_ranking.pl |
Allow | /sitemap.xml |
Allow | /view_news.pl |
Allow | /view_special_topics.pl |
Other Records
Field | Value |
---|---|
sitemap | http://news.mixi.jp/sitemap.xml |