creci-pb.gov.br
robots.txt

Robots Exclusion Standard data for creci-pb.gov.br

Resource Scan

Scan Details

Site Domain creci-pb.gov.br
Base Domain creci-pb.gov.br
Scan Status Ok
Last Scan2025-05-06T13:08:49+00:00
Next Scan 2025-06-05T13:08:49+00:00

Last Scan

Scanned2025-05-06T13:08:49+00:00
URL https://creci-pb.gov.br/robots.txt
Domain IPs 104.21.22.139, 172.67.205.29, 2606:4700:3032::6815:168b, 2606:4700:3034::ac43:cd1d
Response IP 172.67.205.29
Found Yes
Hash 3ad572c8a035d7ad938d11e386876ed91230e62801d9c3c160847229e1e73a18
SimHash 194548c0dc1a

Groups

*

Rule Path
Disallow /wp-content/uploads/wpo/wpo-plugins-tables-list.json

*

Rule Path
Disallow /wp-json/
Disallow /?rest_route=

Other Records

Field Value
sitemap https://creci-pb.gov.br/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK