geobear.pl
robots.txt

Robots Exclusion Standard data for geobear.pl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	geobear.pl
Base Domain	geobear.pl
Scan Status	Ok
Last Scan	2025-10-14T04:32:19+00:00
Next Scan	2025-11-13T04:32:19+00:00

Last Scan

Scanned	2025-10-14T04:32:19+00:00
URL	https://geobear.pl/robots.txt
Domain IPs	104.21.1.234, 172.67.152.146, 2606:4700:3032::ac43:9892, 2606:4700:3036::6815:1ea
Response IP	104.21.1.234
Found	Yes
Hash	65d528bc09acb5c63d3ef4d771b11a49742285c36592cdc8ad3d23afd19265e0
SimHash	6b509d4a8d35

Groups

*

Rule	Path
Allow	/wp-admin/admin-ajax.php
Allow	//.css
Allow	//.js
Disallow	/wp-admin/
Disallow	/wp-includes/
Disallow	/readme.html
Disallow	/license.txt
Disallow	/xmlrpc.php
Disallow	/wp-login.php
Disallow	/wp-register.php
Disallow	/disclaimer/
Disallow	*?attachment_id=
Disallow	/*.pdf
Allow	/subsidence-map.pdf
Allow	/thank-you-residential-extra-info/
Allow	/residential-thanks/

Rule

Path

Allow

/wp-admin/admin-ajax.php

Allow

/*/*.css

Allow

/*/*.js

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/readme.html

Disallow

/license.txt

Disallow

/xmlrpc.php

Disallow

/wp-login.php

Disallow

/wp-register.php

Disallow

*/disclaimer/*

Disallow

*?attachment_id=

Disallow

/*.pdf

Allow

/*subsidence-map*.pdf

Allow

/thank-you-residential-extra-info/

Allow

/residential-thanks/

*

Rule	Path
Allow	/

Rule

Path

Allow

/

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

/

claudebot

Rule	Path
Allow	/

Rule

Path

Allow

/

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://www.geobear.co.uk/sitemap_index.xml

Field

Value

sitemap

https://www.geobear.co.uk/sitemap_index.xml

Back to top

Comments

Instructions for all web crawlers, including AI agents.
This file is the current web standard.
Explicit allow rules for major AI crawlers

Back to top

geobear.plrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

gptbot

google-extended

claudebot

perplexitybot

Other Records

Comments

geobear.pl
robots.txt