newcinemaschool.com
robots.txt

Robots Exclusion Standard data for newcinemaschool.com

Resource Scan

Scan Details

Site Domain newcinemaschool.com
Base Domain newcinemaschool.com
Scan Status Ok
Last Scan2024-05-13T18:35:19+00:00
Next Scan 2024-06-12T18:35:19+00:00

Last Scan

Scanned2024-05-13T18:35:19+00:00
URL https://newcinemaschool.com/robots.txt
Domain IPs 2a00:15f8:a000:5:1:11:5:fab6, 2a00:15f8:a000:5:1:12:5:fab6, 2a00:15f8:a000:5:1:13:5:fab6, 2a00:15f8:a000:5:1:14:5:fab6, 90.156.201.111, 90.156.201.34, 90.156.201.39, 90.156.201.59
Response IP 90.156.201.34
Found Yes
Hash 31b37f1dda00aa8fccbbcbae91a8a64d18f1ae936053c09e039bc6951a9b0a03
SimHash 634088c68e92

Groups

*

Rule Path
Disallow /assets/backup/
Disallow /assets/cache/
Disallow /assets/docs/
Disallow /assets/export/
Disallow /assets/import/
Disallow /assets/modules/
Disallow /assets/plugins/
Disallow /assets/snippets/
Disallow /assets/packages/
Disallow /assets/tvs/
Disallow /install/
Allow /assets/cache/images/
Allow /assets/modules/*.css
Allow /assets/modules/*.js
Allow /assets/plugins/*.css
Allow /assets/plugins/*.js
Allow /assets/snippets/*.css
Allow /assets/snippets/*.js

Other Records

Field Value
sitemap https://www.newcinemaschool.com/sitemap.xml

Comments

  • Default modx exclusions

Warnings

  • `host` is not a known field.