app.duedil.com
robots.txt

Robots Exclusion Standard data for app.duedil.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	app.duedil.com
Base Domain	duedil.com
Scan Status	Ok
Last Scan	2025-10-12T04:55:12+00:00
Next Scan	2025-11-11T04:55:12+00:00

Last Scan

Scanned	2025-10-12T04:55:12+00:00
URL	https://app.duedil.com/robots.txt
Domain IPs	104.26.14.91, 104.26.15.91, 172.67.70.225
Response IP	172.67.70.225
Found	Yes
Hash	86f8c4d66f1e51f990a7e672b266cc8b03ec8430bc9f582cfee11727119c3d7a
SimHash	ba971d0acd74

Groups

*

Rule	Path
Disallow	/

Rule

Path

Disallow

/

seekbot/1.0*

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/includes/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/themes/
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/contact/
Disallow	/logout/
Disallow	/node/add/
Disallow	/user/register/
Disallow	/user/password
Disallow	/signup_link_share
Disallow	/signup/
Disallow	/verify_signup/
Disallow	/user
Disallow	/user/
Disallow	/ajax_fast_offsite/
Disallow	/ajax_slow_offsite/
Disallow	/linkedin/authenticate
Disallow	/login/
Disallow	/*edit$
Disallow	*/contact$
Disallow	/lists/introduction/
Disallow	*/map$
Disallow	/company////contact$
Disallow	/company////group$
Disallow	/company////documents$
Disallow	/company////risk$
Disallow	/company////activity$
Disallow	/company////news$
Disallow	/company////financials$
Disallow	/company////ownership$
Disallow	*/ownership/parent-companies$
Disallow	/?q=admin%2F
Disallow	/?q=comment%2Freply%2F
Disallow	/?q=contact%2F
Disallow	/?q=logout%2F
Disallow	/?q=node%2Fadd%2F
Disallow	/?q=search%2F
Disallow	/?q=user%2Fpassword%2F
Disallow	/?q=user%2Fregister%2F
Disallow	/?q=user%2Flogin%2F
Disallow	?status=
Disallow	?view=
Disallow	?to=
Disallow	/company/fr/
Disallow	/company/de/
Disallow	/company/be/
Disallow	/company/nl/
Disallow	/company/lu/
Disallow	/company/no/
Disallow	/company/se/
Disallow	/company/fi/
Disallow	/company/it/
Disallow	/company/pt/
Disallow	/company/hu/
Disallow	/company/pl/
Disallow	/company/lt/
Disallow	/company/sk/
Disallow	/company/mt/
Disallow	/company/is/

Rule

Path

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/contact/

Disallow

/logout/

Disallow

/node/add/

Disallow

/user/register/

Disallow

/user/password

Disallow

/signup_link_share

Disallow

/signup/

Disallow

/verify_signup/

Disallow

/user

Disallow

/user/

Disallow

/ajax_fast_offsite/

Disallow

/ajax_slow_offsite/

Disallow

/linkedin/authenticate

Disallow

/login/

Disallow

/*edit$

Disallow

*/contact$

Disallow

*/lists/introduction/*

Disallow

*/map$

Disallow

*/company/*/*/*/contact$

Disallow

*/company/*/*/*/group$

Disallow

*/company/*/*/*/documents$

Disallow

*/company/*/*/*/risk$

Disallow

*/company/*/*/*/activity$

Disallow

*/company/*/*/*/news$

Disallow

*/company/*/*/*/financials$

Disallow

*/company/*/*/*/ownership$

Disallow

*/ownership/parent-companies$

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=contact%2F

Disallow

/?q=logout%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Disallow

*?status=*

Disallow

*?view=*

Disallow

*?to=*

Disallow

*/company/fr/*

Disallow

*/company/de/*

Disallow

*/company/be/*

Disallow

*/company/nl/*

Disallow

*/company/lu/*

Disallow

*/company/no/*

Disallow

*/company/se/*

Disallow

*/company/fi/*

Disallow

*/company/it/*

Disallow

*/company/pt/*

Disallow

*/company/hu/*

Disallow

*/company/pl/*

Disallow

*/company/lt/*

Disallow

*/company/sk/*

Disallow

*/company/mt/*

Disallow

*/company/is/*

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

companybook-crawler*

Rule	Path
Disallow	/
Disallow	/legal/enterprise/april2018terms

Rule

Path

Disallow

/

Disallow

/legal/enterprise/april2018terms

Back to top

Comments

$Id: robots.txt,v 1.9.2.2 2010/09/06 10:37:16 goba Exp $
robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
DueDil robots.txt - Complete no-index configuration
Prevent all search engines from crawling and indexing the entire site
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
NoIndex
International
Directors
Misc

Back to top

Warnings

`noindex` is not a known field.

Back to top

app.duedil.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

seekbot/1.0*

*

Other Records

companybook-crawler*

Comments

Warnings

app.duedil.com
robots.txt