cpecn.com
robots.txt

Robots Exclusion Standard data for cpecn.com

Resource Scan

Scan Details

Site Domain cpecn.com
Base Domain cpecn.com
Scan Status Ok
Last Scan2024-10-08T10:50:50+00:00
Next Scan 2024-11-07T10:50:50+00:00

Last Scan

Scanned2024-10-08T10:50:50+00:00
URL https://cpecn.com/robots.txt
Domain IPs 50.56.2.116
Response IP 50.56.2.116
Found Yes
Hash e81b3a5418700c5edb88ac90ea9b219e5a1e07b794dd0c4d2a4e1c38a79c9577
SimHash 7b6fd868e812

Groups

*

Rule Path
Disallow *?p=*
Disallow /wp-includes/
Disallow /wp-login.php
Disallow /wp-admin/*
Disallow /wp-register.php
Disallow /wp-content/themes/pubx/includes/*
Disallow */tag/*
Disallow *?s=*
Disallow /search/*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cpecn.com/sitemap_index.xml