directory5.org
robots.txt
Robots Exclusion Standard data for directory5.org
Resource Scan
Scan Details
Site Domain | directory5.org |
Base Domain | directory5.org |
Scan Status | Ok |
Last Scan | 2024-11-11T23:00:22+00:00 |
Next Scan | 2024-11-18T23:00:22+00:00 |
Last Scan
Scanned | 2024-11-11T23:00:22+00:00 |
URL | https://directory5.org/robots.txt |
Domain IPs | 23.239.109.118 |
Response IP | 23.239.109.118 |
Found | Yes |
Hash | 0f2fbe8b54cabd3577a681e3cf4006985312273dc943b090e6768ff13e60c703 |
SimHash | 45d0c9d3427d |
Groups
*
Rule | Path |
---|---|
Disallow | /*page |
Disallow | /*search |
Disallow | /*p%3D |
Disallow | /*gosearch.php |
Disallow | /*s%3D |
Disallow | /*id%3D |
Disallow | /*rss.php |
Disallow | /*sitemap.xml |
Disallow | /*sitemap.xml.gz |
Disallow | /*urllist.txt |
Disallow | /*?p= |
Disallow | /*?s= |
Disallow | /*?id= |
Disallow | /*cgi-bin/ |
Disallow | /*cat_id |
Disallow | /*search.php? |
Disallow | /*% |
Disallow | /*submit.php |
Disallow | /*detail.php |
Disallow | /*details.php |
Disallow | /*details |
Disallow | /*detail |
Disallow | /*_1 |
Disallow | /*_2 |
Disallow | /*_3 |
Disallow | /*_4 |
Disallow | /*_5 |
Disallow | /*_6 |
Disallow | /*_7 |
Disallow | /*_8 |
Disallow | /*_9 |