If you ever need to capture the smallest or largest "n" from a stream of data, the approach more often than not will be to use a simple data-structure called the Priority Queue.
Priority Queues do one thing very well - once a bunch of data is added, it can return the lowest value (or the highest value) in constant time.
How is this useful to answer a top or bottom "n" type question. Let's see.
Consider this hypothetical stream of data:
- with a size limit of 2
- which returns the largest of these 2 when asked for (sometimes referred to as a Max Priority Queue)
Two considerations as data streams in:
- if the size of the priority queue is less than 2 then add the value to the priority queue
- if the size of the priority queue is equal to 2, then compare the value from the stream with the largest in the queue
- if less then remove the largest and add the new value
- if more then do nothing
At the bottom is the state of the Priority Queue as each data in the stream is processed:
See how it always holds the bottom 2.
Similarly for the largest 3, a Priority Queue with a max capacity of 3 which returns the smallest (referred to as a Min Priority Queue) can be used the following way:
- if the size of the priority queue is less than 3, then add to the priority queue
- if the size is equal to 2, then check the value from the stream with the smallest in the queue
- if more then remove smallest add the value from stream and ignore otherwise
Implementation
Here is a simple kotlin based implementation that uses the built in PriorityQueue in Java standard library.
fun findNSmallestAndLargest(nums: List<Int>, n: Int): Pair<List<Int>, List<Int>> {
val minFirst: Comparator<Int> = Comparator.naturalOrder<Int>()
val maxFirst: Comparator<Int> = minFirst.reversed()
val minPq: PriorityQueue<Int> = PriorityQueue(minFirst)
val maxPq: PriorityQueue<Int> = PriorityQueue(maxFirst)
for (num in nums) {
checkAndAddIfSmallest(maxPq, n, num)
checkAndAddIfLargest(minPq, n, num)
}
return maxPq.toList() to minPq.toList()
}
private fun checkAndAddIfSmallest(maxPq: PriorityQueue<Int>, n: Int, num: Int) {
if (maxPq.size < n) {
maxPq.add(n)
} else if (num < maxPq.peek()) {
maxPq.poll()
maxPq.add(num)
}
}
private fun checkAndAddIfLargest(minPq: PriorityQueue<Int>, n: Int, num: Int) {
if (minPq.size < n) {
minPq.add(n)
} else if (num > minPq.peek()) {
minPq.poll()
minPq.add(num)
}
}
The implementation is very straightforward and follows the outlined algorithm to the letter.