Logo

Greetings from The On-Line Encyclopedia of Integer Sequences!

Hints

Search: id:A094913
Displaying 1-1 of 1 results found. page 1
     Format: long | short | internal | text      Sort: relevance | references | number      Highlight: on | off
A094913 Maximal number of distinct nonempty substrings of any binary string of length n. +0
4
0, 1, 3, 5, 8, 12, 16, 21, 27, 34, 42, 50, 59, 69, 80, 92, 105, 119, 134, 150, 166, 183, 201, 220, 240, 261, 283, 306, 330 (list; graph; listen)
OFFSET

1,3

COMMENT

It would be more natural to include the empty substring, which would result in the sequence b(n)=a(n)+1 ; all values computed so far confirm the conjecture that a(n)+1 = A006697(n). - M. F. Hasler, Dec 17 2007

In addition to the example given, computation finds 17 other binary strings of length 6 which contain the maximal number a(6)=16 of distinct substrings. Interestingly, each of the 18 such instances gives the same numbers of substrings of each possible length, in this case 2,4,4,3,2,1 substrings of lengths 1 through 6, respectively.

Calculations suggest that, for any n>0, binary strings of length n exist such that the number of distinct substrings of length k, 1<=k<=n, is as large as possible consistent with basic counting principles, i.e. n-k+1 substrings of length k starting at each of the n-k+1 possible starting positions, subject to the condition, however, that there cannot be more than 2^k distinct binary strings of length k. This suggests the following conjecture. Conjecture. The maximum number a(n) of distinct substrings of a binary string of length n is given by a(n)=Sum[t(k), k=1..n] where t(k) is the smaller of 2^k and n-k+1.

Conjecture: a(n) = A006697(n)-1. - Vladeta Jovovic (vladeta(AT)Eunet.yu), Sep 19 2005

Conjecture: a(n) = 2^(m+1)-2 + (n-m)(n-m+1)/2, where m = [ n+1-LambertW( 2^(n+1) * log(2) ) / log(2) ] = integer part of the solution to 2^x = n+1-x. This is is based on the reasoning that you can construct the word of length n so that it contains the maximal possible number, max( n-k+1 , 2^k ), of different substrings of length k. - M. F. Hasler, Dec 17 2007

Comment from Peter Pein (petsie(AT)dordos.net), Jan 18 2008 (Start): The following heuristic seems to wrk for computing maximal number of distinct substrings of a binary string of length n.

Start with the empty list and at each step try to insert a zero or an one at each possible position. Then pick the first list with maximal number of sublists and start over.

Say we have had {0,0,1,1,0} as one of the lists with the maximal number of sublists. Then my candidates for the next test are:

With[{lastbest = {0, 0, 1, 1, 0}}, Union[Flatten[ Outer[Insert[lastbest, #2, #1] & , Range[1 + Length[lastbest]], {0, 1}, 1], 1]]]

{{0, 0, 0, 1, 1, 0}, {0, 0, 1, 0, 1, 0}, {0, 0, 1, 1, 0, 0}, {0, 0, 1, 1, 0, 1}, {0, 0, 1, 1, 1, 0}, {0, 1, 0, 1, 1, 0}, {1, 0, 0, 1, 1, 0}}

See http://freenet-homepage.de/Peter_Berlin/Mathe/heuristicA094913.nb for the Mathematica notebook with the complete (simple) algorithm. There's a screenshot too (same URL but with .png instead of .nb).

If this works correctly, the first 100 values of A094913 are calculated in 30 secs. (End) [See also the remarks in the Fuller link in A134457]

EXAMPLE

The binary string 000110 of length 6 contains the 16 distinct substrings 0, 1, 00, 01, 11, 10, 000, 001, 011, 110, 0001, 0011, 0110, 00011, 00110, 000110 and a computer search shows that no other binary string of length 6 contains more than 16. Thus a(6)=16.

CROSSREFS

Cf. A006697, A134457, A134466

Adjacent sequences: A094910 A094911 A094912 this_sequence A094914 A094915 A094916

Sequence in context: A023660 A023562 A000212 this_sequence A020678 A014811 A131674

KEYWORD

nonn,nice

AUTHOR

John W. Layman (layman(AT)math.vt.edu), Jun 17 2004

EXTENSIONS

a(19)-a(28) from David W. Wilson (wilson.d(AT)anseri.com), Dec 17 2007

page 1

Search completed in 0.002 seconds

Lookup | Welcome | Find friends | Music | Plot 2 | Demos | Index | Browse | More | WebCam
Contribute new seq. or comment | Format | Transforms | Puzzles | Hot | Classics
More pages | Superseeker | Maintained by N. J. A. Sloane (njas@research.att.com)

Last modified May 16 01:24 EDT 2008. Contains 139630 sequences.


AT&T Labs Research