app.perusall.com
robots.txt

Robots Exclusion Standard data for app.perusall.com

Resource Scan

Scan Details

Site Domain app.perusall.com
Base Domain perusall.com
Scan Status Ok
Last Scan2025-11-28T07:52:26+00:00
Next Scan 2025-12-28T07:52:26+00:00

Last Scan

Scanned2025-11-28T07:52:26+00:00
URL https://app.perusall.com/robots.txt
Domain IPs 2600:9000:28c2:1c00:1f:6df1:2700:93a1, 2600:9000:28c2:6800:1f:6df1:2700:93a1, 2600:9000:28c2:6c00:1f:6df1:2700:93a1, 2600:9000:28c2:9000:1f:6df1:2700:93a1, 2600:9000:28c2:bc00:1f:6df1:2700:93a1, 2600:9000:28c2:c200:1f:6df1:2700:93a1, 2600:9000:28c2:d000:1f:6df1:2700:93a1, 2600:9000:28c2:e400:1f:6df1:2700:93a1, 3.171.198.16, 3.171.198.55, 3.171.198.70, 3.171.198.88
Response IP 3.171.198.70
Found Yes
Hash 285ddc8dad0a652e3ea127131de4ccedfba89cc6572d69a9bf907ad90c8a6227
SimHash a1c5b5c4ebf2

Groups

*

Rule Path
Disallow /welcome
Disallow /inactivity
Disallow /home
Disallow /404
Disallow /restricted
Disallow /purchase
Disallow /legal
Disallow /invitation
Disallow /book_club_invitation
Disallow /lti
Disallow /unsubscribe
Disallow /not_available
Disallow /r/
Disallow /courses
Disallow /join
Disallow /sample