s-space.snu.ac.kr
robots.txt

Robots Exclusion Standard data for s-space.snu.ac.kr

Resource Scan

Scan Details

Site Domain s-space.snu.ac.kr
Base Domain snu.ac.kr
Scan Status Ok
Last Scan2024-11-03T10:45:52+00:00
Next Scan 2024-12-03T10:45:52+00:00

Last Scan

Scanned2024-11-03T10:45:52+00:00
URL https://s-space.snu.ac.kr/robots.txt
Domain IPs 147.46.181.99
Response IP 147.46.181.99
Found Yes
Hash 51d13630cb646bb9b5032bdb5bbbfa69f9d0296d35471ae14092d8340e5bde26
SimHash a9d95980ceb7

Groups

*

Rule Path
Disallow /discover
Disallow /simple-search
Disallow /open-search
Disallow /export-excel
Disallow /export-dc
Disallow /export-ris
Disallow /export
Disallow /json

petalbot

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

fast

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

Other Records

Field Value
sitemap https://s-space.snu.ac.kr/sitemap
sitemap https://s-space.snu.ac.kr/htmlmap