Or subtitled: Why turn down a chance to completely embarrass myself on the internet?

I read this post on Binary search on the Reinvigorated Programmer blog (In trying to learn more programming, I thought it wouldn’t hurt to read some programming blogs; plus as an unexpected bonus he likes Doctor Who) and thought I’d have a crack at it. However, the rules were a bit hard for me, namely:

“NO TESTING until after you’ve decided your program is correct.”

Even with Ruby, which I’m vaguely familiar with, I knew I couldn’t possibly write even the most basic program without resorting to trying out stuff in IRB. It’s the same as simple maths (times tables, etc) and spelling for me - I make no attempt to memorise this stuff when it’s easy to use a calculator, dictionary, etc. So that’s how my programming works. I can write a vague outline of the logic required in note form, but I need to use IRB or (if in another language) break my programme into chunks to compile and test, etc.

So, an honest a first attempt as possible (with just checking things like “How do ranges work in arrays again?” in IRB (told you it was embarrassing)) would be as follows:

First Attempt

#Assumes a pre-sorted array
def binsearch(array, target)
	m = (array.length/2).floor #round down 
	until array[m] == target
		#Find middle value of array
		m = (array.length/2).floor #round down 
		puts m
		#Which half contains target
		if array[m] > target
			#delete top half
			array = array[0..m]
		else 
			#delete bottom half
			array = array[m..-1]
		end
	end
	puts m
	puts target
	#Do we need to know where we found it?
end

This doesn’t work, the main problem being that I didn’t understand/forgot/didn’t-think-about-it-properly how the loop would check its condition. With m being updated at the start of the loop, it’s too late - the loop has begun so it’ll carry on and never break out of a loop.

Oh well, a bad first guess. At least with that out of the way I could try again, but this time I could test as I went along.

Second Attempt

def binsearch(array, target)
	#Find middle index of array
	m = (array.length/2).floor #round down
	until array[m] == target
		puts m
		#Which half contains target
		if array[m] > target
			#delete top half
			array = array[0..m]
		else 
			#delete bottom half
			array = array[m..-1]
		end
		m = (array.length/2).floor #round down 
	end
	puts "Value found is " + array[m].to_s
	puts "Target was " + target.to_s
	#Do we need to know where we found it?
end

The main change is moving the update of m from top of loop to end. This got it basically working. I then decided (and I don’t think this is a requirement of the task set) to report out the index of the array where the result was found (because to me there didn’t seem to be a lot of point searching for something without knowing where it is).

Third attempt

def binsearch(array, target)
	#Duplicate array so we have a copy of the original
	orig = array.dup

	#Find middle index of array
	m = (array.length/2).floor #round down
	#tracking index
	tidx = m
	until array[m] == target
		puts "m= " + m.to_s
		puts "tidx= " + tidx.to_s
		#Which half contains target
		if array[m] > target
			#delete top half
			array = array[0..m]
			subtractidx = true
		else 
			#delete bottom half
			array = array[m+1..-1]
			subtractidx = false
		end
		m = (array.length/2).floor 
		if subtractidx == true
			tidx = tidx - m + 1
		else
			tidx = tidx + m + 1
		end
	end
	puts "Value found is " + array[m].to_s
	puts "Target was " + target.to_s
	puts "Index of value is " + tidx.to_s
	puts "Value at index is " + orig[tidx].to_s
end

I’d mucked up splitting the arrays and had overlapping ranges, 0..m and m..-1 so fixed that here as well. This attempt seemed to be working ok, but having accidentally seen a bit of the next post that dealt with common bugs in other people’s attempts at this problem, I knew there were things I’d not done: such as dealing with elements not in the array, etc. So that would be the next effort. Fair enough I wouldn’t have got this with my first “paper” attempt anyway, but I would have twigged pretty soon with testing (and as I said, testing is how I develop).

Fourth attempt:

def binsearch(array, target)
	#Duplicate array so we have a copy of the original
	orig = array.dup
	found = true
	#Find middle index of array
	m = (array.length/2).floor #round down
	#tracking index
	tidx = m
	until array[m] == target
		puts "m= " + m.to_s
		puts "tidx= " + tidx.to_s
		#Which half contains target
		if array[m] > target
			#delete top half
			array = array[0..m]
			subtractidx = true
		else 
			#delete bottom half
			array = array[m+1..-1]
			subtractidx = false
		end
		m = (array.length/2).floor 
		if subtractidx == true
			tidx = tidx - m + 1
		else
			tidx = tidx + m + 1
		end
		if array.length == 1 and array[m] != target #if not found
			found = false
			break
		end
	end
	puts found
	if not found
		puts "Value not found"      
	else
		puts "Value found is " + array[m].to_s
		puts "Target was " + target.to_s
		puts "Index of value is " + tidx.to_s
		puts "Value at index is " + orig[tidx].to_s
	end
end

This still wasn’t completely right though. I.e. the index was not always right.

Fifth attempt

def binsearch(array, target)
	#Duplicate array so we have a copy of the original
	orig = array.dup
	found = true
	#Find middle index of array
	m = (array.length/2)-1
	#tracking index
	tidx = m
	until array[m] == target
		puts array.to_s
		puts "m= " + m.to_s
		puts "tidx= " + tidx.to_s
		#Which half contains target
		if array[m] > target
			#delete top half
			array = array[0..m]
			subtractidx = true
		else 
			#delete bottom half
			array = array[m+1..-1]
			subtractidx = false
		end
		m = (array.length/2)-1 
		if subtractidx == true
			tidx = tidx - m - 1 
		else
			tidx = tidx + m
		end
		if array.length == 1 and array[m] != target #if not found
			found = false
			break
		end
	end
	puts found
	if not found
		puts "Value not found"		
	else
		puts array.to_s
		puts "m= " + m.to_s
		puts "tidx= " + tidx.to_s
		puts "Value found is " + array[m].to_s
		puts "Target was " + target.to_s
		puts "Index of value is " + tidx.to_s
		puts "Value at index is " + orig[tidx].to_s
	end
 end

Trying to get the index correct, but still crap. I did discover though that I didn’t need the floor method to round down, as that was happening anyway (see, further embarrassment). Decided to give up figuring this (index tracking) out. Must be a simpler way. Rather than split the array up, just change the range we are looking at:

Sixth (and final-ish) attempt

def binsearch(array, target)
	#Assume search suceeds
	found = true
	#initial lower, upper and middle indexes
	l = 0
	u = array.length-1
	m = (u-l)/2
	until array[m] == target
		#Visualise the search
		puts array[l..u].to_s
		puts "l= " + l.to_s
		puts "m= " + m.to_s
		puts "u= " + u.to_s
		#Which half contains target?
		if array[m] > target
			u = m #Set new upper boundary
		else 
			l = m + 1 #set new lower boundary
		end
		m = (u-l)/2+l #set new mid point
		#Break if not found / out-of-range
		if l==u and array[m] != target
			found = false
			break
		end
	end
	#Visualise search
	puts array[l..u].to_s
	puts "l= " + l.to_s
	puts "m= " + m.to_s
	puts "u= " + u.to_s
	if not found
		puts "Value not found"		
	else
		puts "Value found is " + array[m].to_s
		puts "Target was " + target.to_s
		puts "Index found at is " + m.to_s		
	end
 end

Ok, so pretty happy with that. Seems to work ok. It’s longer than it needs to be thanks to all the puts statements, but I like seeing how it arrives at the solution. Seems to find values, return correct indexes, deal with values not in the array. But what about single element or zero element arrays (also accidentally peeked at)? Whoops no good.

e.g.

binsearch([0], 1)
binsearch([-1], 1)
binsearch([-1], 0)

all just loop continuously, and

binsearch([], 1) 

throws an error.

Zero length arrays are not such a worry, easy to check for that up front (even for me!), but the single element one going into a continuous loop is a more of a concern. A crappy, but simple fix would be:

if array.length == 1
	if array[0] == target
		#Joy to the world
	else
		#Best stop things now
		break
	end
end

But I’m sure there is a more elegant way?

Finally

I’d like to be able to use a get out clause:

I am confident that nearly everyone who reads this blog is already familiar with the binary search algorithm

and say “Well, nope I’m not familiar”, which I’m not, but I did understand the principle of the search. Really this just harks back to school Maths (which was a while ago for me) and iterative searches, etc. “Binary Search” is just different terminology. So no excuse really - I should have been able to figure it out. Perhaps what I should have done was forget the programming language aspect and just write it down logically/mathematically. Ah well, eager beaver and that.

Guess I have a long way to go yet if I want to be a programmer.