weproject.media
robots.txt

Robots Exclusion Standard data for weproject.media

Resource Scan

Scan Details

Site Domain weproject.media
Base Domain weproject.media
Scan Status Ok
Last Scan2024-11-08T22:54:41+00:00
Next Scan 2024-11-15T22:54:41+00:00

Last Scan

Scanned2024-11-08T22:54:41+00:00
URL https://weproject.media/robots.txt
Domain IPs 5.178.85.178
Response IP 5.178.85.178
Found Yes
Hash e965cb5568dc358cd160f26d6879d4b35390019664c5e78a3744a4a151c621bd
SimHash 2250bc3f45b0

Groups

*

Rule Path
Allow /bitrix/components/
Allow /bitrix/cache/
Allow /bitrix/js/
Allow /bitrix/templates/
Allow /bitrix/panel/
Disallow */index.php
Disallow /bitrix/
Disallow *show_include_exec_time%3D*
Disallow *show_page_exec_time%3D*
Disallow *show_sql_stat%3D*
Disallow *bitrix_include_areas%3D*
Disallow *clear_cache%3D*
Disallow *clear_cache_session%3D*
Disallow *ADD_TO_COMPARE_LIST*
Disallow *ORDER_BY*
Disallow *?PAGEN*
Disallow *print%3D*
Disallow *view_result%3D*
Disallow /*print_course%3D
Disallow *action%3D*
Disallow *bxajaxid%3D*
Disallow *register%3D*
Disallow *forgot_password%3D*
Disallow *change_password%3D*
Disallow *login%3D*
Disallow *logout%3D*
Disallow *auth%3D*
Disallow *backurl%3D*
Disallow *back_url%3D*
Disallow *BACKURL%3D*
Disallow *BACK_URL%3D*
Disallow *back_url_admin%3D*
Disallow *?utm_source=*

Other Records

Field Value
sitemap https://weproject.media/sitemap.xml

Warnings

  • `host` is not a known field.