njbiz.com
robots.txt

Robots Exclusion Standard data for njbiz.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	njbiz.com
Base Domain	njbiz.com
Scan Status	Ok
Last Scan	2024-06-05T01:04:49+00:00
Next Scan	2024-06-12T01:04:49+00:00

Last Scan

Scanned	2024-06-05T01:04:49+00:00
URL	https://njbiz.com/robots.txt
Domain IPs	141.193.213.20, 141.193.213.21
Response IP	141.193.213.20
Found	Yes
Hash	20586a13416cc9251aba6b9ad35627d60dd74cdcf8eff961c2b51e9eca85e19f
SimHash	891848414673

Groups

*

Rule	Path	Comment
Disallow	/cgi-bin/	-
Disallow	/wp-admin/	-
Disallow	/wp-includes/	-
Disallow	/wp-content/plugins/	-
Disallow	/wp-content/cache/	-
Disallow	/wp-content/themes/	-
Allow	/wp-content/uploads/	-
Disallow	/feed/	-
Disallow	/trackback/	-
Disallow	/print/	wp-print block
Disallow	/index.php	separate directive for the main script file of WP
Disallow	/*?	search results
Disallow	/*.php$	-
Disallow	/*.js$	-
Disallow	/*.inc$	-
Disallow	/*.css$	-
Disallow	*/feed/	-
Disallow	*/trackback/	-
Disallow	*/print/	-
Disallow	/maryland-family-law/wp-files/family-law/*.pdf$	-
Disallow	/maryland-family-law/files/2015/9/*.pdf$	-
Disallow	/maryland-family-law/*.pdf$	-

Rule

Path

Comment

Disallow

/cgi-bin/

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/wp-content/plugins/

Disallow

/wp-content/cache/

Disallow

/wp-content/themes/

Allow

/wp-content/uploads/

Disallow

/feed/

Disallow

/trackback/

Disallow

/print/

wp-print block

Disallow

/index.php

separate directive for the main script file of WP

Disallow

/*?

search results

Disallow

/*.php$

Disallow

/*.js$

Disallow

/*.inc$

Disallow

/*.css$

Disallow

*/feed/

Disallow

*/trackback/

Disallow

*/print/

Disallow

/maryland-family-law/wp-files/family-law/*.pdf$

Disallow

/maryland-family-law/files/2015/9/*.pdf$

Disallow

/maryland-family-law/*.pdf$

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

User-agent: Googlebot-Image
Disallow:
Allow: /
User-agent: Mediapartners-Google
Disallow:
Allow: /
Sitemap: http://yourdomain.com/sitemap.xml

njbiz.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

ccbot

anthropic-ai

claude-web

google-extended

Comments

njbiz.com
robots.txt