21stcenturypublishing.com
robots.txt
Robots Exclusion Standard data for 21stcenturypublishing.com
Resource Scan
Scan Details
Site Domain | 21stcenturypublishing.com |
Base Domain | 21stcenturypublishing.com |
Scan Status | Ok |
Last Scan | 4/9/2025, 8:13:00 PM |
Next Scan | 5/9/2025, 8:13:00 PM |
Last Scan
Scanned | 4/9/2025, 8:13:00 PM |
URL | https://21stcenturypublishing.com/robots.txt |
Domain IPs | 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001 |
Response IP | 104.21.112.1 |
Found | Yes |
Hash | ae941a4560acaf2ca29cf74463f99975efd346cbc61cd079abda1532954d04ab |
SimHash | 263f33414dd1 |
Groups
*
Rule | Path |
---|---|
Disallow | /noindex/ |
Disallow | /*? |
Disallow | /vip.html |
Disallow | /*?sort= |
Disallow | /*%26sort%3D |
Disallow | /*?order= |
Disallow | /*%26order%3D |
Disallow | /*?limit= |
Disallow | /*%26limit%3D |
Disallow | /*?filter_name= |
Disallow | /*%26filter_name%3D |
Disallow | /*?filter_sub_category= |
Disallow | /*%26filter_sub_category%3D |
Disallow | /*?filter_description= |
Disallow | /*%26filter_description%3D |
Disallow | /*?tracking= |
Disallow | /*%26page%3D |
Disallow | |
Disallow | /*print%3D |
Disallow | /*tag |
Disallow | /*% |
Disallow | /search* |
Disallow | /*start |
Disallow | /*%3Datom |
Disallow | /*%3Drss |
Disallow | /*print%3D1 |
Disallow | /*? |
Disallow | /*%26 |
Disallow | /404.html |
Disallow | /404/ |
Other Records
Field | Value |
---|---|
sitemap | https://21stcenturypublishing.com/sitemap.xml |
Warnings
- 1 invalid line.