novacustomboxes.com
robots.txt

Robots Exclusion Standard data for novacustomboxes.com

Resource Scan

Scan Details

Site Domain novacustomboxes.com
Base Domain novacustomboxes.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-10-18T01:04:27+00:00
Next Scan 2025-11-01T01:04:27+00:00

Last Successful Scan

Scanned2025-10-03T00:47:29+00:00
URL https://novacustomboxes.com/robots.txt
Redirect https://www.novacustomboxes.com/robots.txt
Redirect Domain www.novacustomboxes.com
Redirect Base novacustomboxes.com
Domain IPs 2a02:4780:b:673:0:1c54:2c70:6, 46.202.196.61
Redirect IPs 2a02:4780:b:673:0:1c54:2c70:6, 46.202.196.61
Response IP 46.202.196.61
Found Yes
Hash fdb057f18eb64f977279346717798cc07388e1fb749a6de984ea0bfb794a8137
SimHash 3f565392c33b

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 20

openai

Rule Path
Disallow /

chatgpt-3

Rule Path
Disallow /
Disallow /wp-json/
Disallow /?rest_route=
Disallow /search/
Disallow /?s=

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /wp-content/uploads/

applebot

Rule Path
Allow /

yandex

Rule Path
Allow /

yandeximages

Rule Path
Allow /wp-content/uploads/

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

qwantify

Rule Path
Allow /

baiduspider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sosospider+

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

facebook

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

facebookscraper

Rule Path
Allow /

facebot

Rule Path
Allow /

instagrambot

Rule Path
Allow /

whatsapp bot

Rule Path
Allow /

telegrambot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

pinterest bot

Rule Path
Allow /

discordbot

Rule Path
Allow /

*

Rule Path
Disallow /*.pdf$

*

Rule Path
Disallow /*.docx$

*

Rule Path
Disallow /*.html$

*

Rule Path
Disallow /*.php$

Other Records

Field Value
sitemap https://www.novacustomboxes.com/sitemap_index.xml
sitemap https://www.novacustomboxes.com/post-sitemap.xml
sitemap https://www.novacustomboxes.com/page-sitemap.xml
sitemap https://www.novacustomboxes.com/product-sitemap1.xml
sitemap https://www.novacustomboxes.com/product-sitemap2.xml
sitemap https://www.novacustomboxes.com/category-sitemap.xml
sitemap https://www.novacustomboxes.com/product_cat-sitemap.xml
sitemap https://www.novacustomboxes.com/news-sitemap.xml
sitemap https://www.novacustomboxes.com/video-sitemap.xml
sitemap https://www.novacustomboxes.com/local-sitemap.xml
sitemap https://www.novacustomboxes.com/sitemap.xml

Comments

  • Block ChatGPT Crawler
  • Prevent Crawling of WordPress JSON API Endpoints
  • Block Search URLs /search/ and /?s=
  • Block Moz Crawler
  • Block Majestic Crawler
  • Block archive.org bots
  • Rankmath Sitemap Link
  • Allow Google Bot
  • Allow Google Images Bot
  • Allow Google Media Partners Bot
  • Allow Google AdsBot Bot
  • Allow Google Mobile Bot
  • Allow Bing Bot
  • Allow MSN Bot
  • Allow MSNBot Media Bot
  • Allow Apple Bot
  • Allow Yandex Bot
  • Allow Yandex Images Bot
  • Allow Yahoo Search (Slurp bot)
  • Allow DuckDuckGo Bot
  • Allow Qwant Bot
  • Block Baidu/Sogou/Soso/Youdao Bot
  • Block Naver Bot
  • Block Seznam Bot
  • Allow Facebook Bot
  • Allow Instagram Bot
  • Allow Whatsapp Bot
  • Allow Telegram Bot
  • Allow Twitter Bot
  • Allow Linkedin Bot
  • Allow Pinterest Bot
  • Allow Discord Bot
  • Block PDF Files
  • Block DOCX Files
  • Block Html Files
  • Block Php Files