princetontec.com
robots.txt

Robots Exclusion Standard data for princetontec.com

Resource Scan

Scan Details

Site Domain princetontec.com
Base Domain princetontec.com
Scan Status Ok
Last Scan2024-10-15T15:35:11+00:00
Next Scan 2024-11-14T15:35:11+00:00

Last Scan

Scanned2024-10-15T15:35:11+00:00
URL https://princetontec.com/robots.txt
Domain IPs 104.17.144.110, 104.17.145.110, 2606:4700::6811:906e, 2606:4700::6811:916e
Response IP 104.17.145.110
Found Yes
Hash da2632e03449c9f48cc997088489c86705396e0423527abb981c68f2524ecba9
SimHash 107ecd00c763

Groups

bytespider

Rule Path
Disallow /

bytedance

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

geedobot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

geedoproductsearch

Rule Path
Disallow /

Comments

  • To block Bytespider from crawling:
  • To block Bytedance from crawling:
  • To block Coccocbot from crawling:
  • To block Dotbot from crawling:
  • To block Common Crawl Bot from crawling:
  • To block Common Crawl Bot from crawling:
  • To block GeedoBot from crawling:
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • To block GeedoBot from crawling: