news.mixi.jp
robots.txt

Robots Exclusion Standard data for news.mixi.jp

Resource Scan

Scan Details

Site Domain news.mixi.jp
Base Domain mixi.jp
Scan Status Ok
Last Scan2024-06-02T08:21:15+00:00
Next Scan 2024-07-02T08:21:15+00:00

Last Scan

Scanned2024-06-02T08:21:15+00:00
URL https://news.mixi.jp/robots.txt
Domain IPs 18.180.72.219, 54.238.227.135
Response IP 54.238.227.135
Found Yes
Hash 56991243e1d992976e4b3b08d79f1fcb30fa63fa2a9279d266674c4f20187fa0
SimHash 20440ac10499

Groups

*

Rule Path
Disallow /
Allow /$
Allow /ads.txt
Allow /static/
Allow /list_news.pl
Allow /list_news_category.pl
Allow /list_news_category_touch.pl
Allow /list_news_media.pl
Allow /list_news_topics.pl
Allow /list_quote.pl
Allow /list_special_topics_article.pl
Allow /search_news.pl
Allow /show_media.pl
Allow /show_ranking.pl
Allow /sitemap.xml
Allow /view_news.pl
Allow /view_special_topics.pl

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap http://news.mixi.jp/sitemap.xml