forum.thegradcafe.com
robots.txt
Robots Exclusion Standard data for forum.thegradcafe.com
Resource Scan
Scan Details
Site Domain | forum.thegradcafe.com |
Base Domain | thegradcafe.com |
Scan Status | Ok |
Last Scan | 2024-11-16T15:24:32+00:00 |
Next Scan | 2024-11-23T15:24:32+00:00 |
Last Scan
Scanned | 2024-11-16T15:24:32+00:00 |
URL | https://forum.thegradcafe.com/robots.txt |
Domain IPs | 72.52.144.230 |
Response IP | 72.52.144.230 |
Found | Yes |
Hash | e3772ef5d70e40d4cffc754257201fd83668f1cf64d20cb495de0185731ab358 |
SimHash | 30306a83a4ba |
Groups
*
Rule | Path |
---|---|
Disallow | /startTopic/ |
Disallow | /discover/unread/ |
Disallow | /markallread/ |
Disallow | /staff/ |
Disallow | /cookies/ |
Disallow | /online/ |
Disallow | /discover/ |
Disallow | /leaderboard/ |
Disallow | /search/ |
Disallow | /tags/ |
Disallow | /*?advancedSearchForm= |
Disallow | /register/ |
Disallow | /lostpassword/ |
Disallow | /login/ |
Disallow | /*currency%3D |
Disallow | /*?sortby= |
Disallow | /*?filter= |
Disallow | /*?tab= |
Disallow | /*?do= |
Disallow | /*ref%3D |
Disallow | /*?forumId* |
Disallow | /*?&controller=embed |
Other Records
Field | Value |
---|---|
sitemap | https://forum.thegradcafe.com/sitemap.php |
Comments