pixiv.net
robots.txt

Robots Exclusion Standard data for pixiv.net

Resource Scan

Scan Details

Site Domain pixiv.net
Base Domain pixiv.net
Scan Status Ok
Last Scan2024-04-27T20:08:07+00:00
Next Scan 2024-05-04T20:08:07+00:00

Last Scan

Scanned2024-04-27T20:08:07+00:00
URL https://pixiv.net/robots.txt
Redirect https://www.pixiv.net/robots.txt
Redirect Domain www.pixiv.net
Redirect Base pixiv.net
Domain IPs 210.140.92.181, 210.140.92.183, 210.140.92.187
Redirect IPs 104.18.42.239, 172.64.145.17
Response IP 172.64.145.17
Found Yes
Hash bc39dddb8f97c94c9423f4e01903e214a9611e8e420bfb4fca7874122a569689
SimHash 912a4e0685f1

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /rpc/index.php?mode=profile_module_illusts&user_id=*&illust_id=*
Disallow /ajax/illust/*/recommend/init
Disallow *return_to*
Disallow /?return_to=
Disallow /login.php?return_to=
Disallow /index.php?return_to=
Disallow /artworks/unlisted/*
Disallow /users/*/followers
Disallow /users/*/mypixiv
Disallow /users/*/bookmarks
Disallow /novel/comments.php?id=
Disallow /novels/unlisted/*
Disallow /en/group
Disallow /en/search/
Disallow /en/users/*/followers
Disallow /en/users/*/mypixiv
Disallow /en/users/*/bookmarks
Disallow /en/novel/comments.php?id=
Disallow /fanbox/search
Disallow /fanbox/tag
Allow /comic-indies/$
Allow /comic-indies/about
Disallow /comic-indies/