novel.prcm.jp
robots.txt

Robots Exclusion Standard data for novel.prcm.jp

Resource Scan

Scan Details

Site Domain novel.prcm.jp
Base Domain prcm.jp
Scan Status Ok
Last Scan2024-09-17T09:10:31+00:00
Next Scan 2024-09-24T09:10:31+00:00

Last Scan

Scanned2024-09-17T09:10:31+00:00
URL https://novel.prcm.jp/robots.txt
Domain IPs 2600:1901:0:c4c5::, 35.241.13.68
Response IP 35.241.13.68
Found Yes
Hash 8f8747b826b02343217ffaad9af7317f544f373f97cda6d9544af9ca48e8f6c1
SimHash 60565004c8b2

Groups

ttd-content

Rule Path
Disallow /

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

applebot

Rule Path
Disallow /novel/
Disallow /user/
Disallow /api/

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /user/*/followers
Disallow /user/*/following
Disallow /novel/*/favorite-users
Disallow /novel/*/spotlight-contributors
Disallow /novel/*/chapter/*/like-users
Disallow /novel/*/chapter/*/replace-setting
Disallow /novel/*/chapter/*/comment/list
Disallow /novel/*/chapter/*/edit
Disallow /mine/*
Disallow /book-shelf/*
Disallow /enhance-signing-up/*

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://novel.prcm.jp/sitemap/sitemap-index.xml