scholar.google.com
robots.txt
Robots Exclusion Standard data for scholar.google.com
Resource Scan
Scan Details
Site Domain | scholar.google.com |
Base Domain | google.com |
Scan Status | Ok |
Last Scan | 2024-04-24T02:35:06+00:00 |
Next Scan | 2024-05-24T02:35:06+00:00 |
Last Scan
Scanned | 2024-04-24T02:35:06+00:00 |
URL | https://scholar.google.com/robots.txt |
Domain IPs | 172.253.118.103, 172.253.118.104, 172.253.118.105, 172.253.118.106, 172.253.118.147, 172.253.118.99 |
Response IP | 74.125.130.105 |
Found | Yes |
Hash | c30f4b2fe5dcc4b51dc6c5e06bce15dc28b220c42e91f4c709d749cae4968385 |
SimHash | a927d4209ef1 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /index.html |
Disallow | /scholar |
Disallow | /citations? |
Allow | /citations?user= |
Disallow | /citations?*cstart= |
Disallow | /citations?user=*%40 |
Disallow | /citations?user=*%40 |
Allow | /citations?view_op=list_classic_articles |
Allow | /citations?view_op=metrics_intro |
Allow | /citations?view_op=new_profile |
Allow | /citations?view_op=sitemap |
Allow | /citations?view_op=top_venues |