lemm.ee

Local All Communities Log in Sign up

Local All Communities

97

Big O notation is about what matters when the numbers get big.(programming.dev)

posted 1 year ago

*

by

ericjmorey@programming.dev

in

programming@beehaw.org

cross-posted from: https://programming.dev/post/6660679

It’s about asking, “how does this algorithm behave when the number of elements is significantly large compared to when the number of elements is orders of magnitude larger?”

Big O notation is useless for smaller sets of data. Sometimes it’s worse than useless, it’s misguiding. This is because Big O is only an estimate of asymptotic behavior. An algorithm that is O(n^2) can be faster than one that’s O(n log n) for smaller sets of data (which contradicts the table below) if the O(n log n) algorithm has significant computational overhead and doesn’t start behaving as estimated by its Big O classification until after that overhead is consumed.

#computerscience

Image Alt Text:

"A graph of Big O notation time complexity functions with Number of Elements on the x-axis and Operations(Time) on the y-axis.

Lines on the graph represent Big O functions which are are overplayed onto color coded regions where colors represent quality from Excellent to Horrible

Functions on the graph:
O(1): constant - Excellent/Best - Green
O(log n): logarithmic - Good/Excellent - Green
O(n): linear time - Fair - Yellow
O(n * log n): log linear - Bad - Orange
O(n^2): quadratic - Horrible - Red
O(n^3): cubic - Horrible (Not shown)
O(2^n): exponential - Horrible - Red
O(n!): factorial - Horrible/Worst - Red"

Source

Sort:

Hot Top Controversial New Old

[ +- ]

off_brand_@beehaw.org

22 points

1 year ago

Yup, it’s why O(N+10) and even O(2N) are effectively the same as O(N) on your CS homework. Speaking too generally, once you’re dithering over the efficiency of an algorithm processing a 100-item dataset you’ve probably gone too far in weeds. And optimizations can often lead to messy code for not a lot of return.

That’s mostly angled at new grads (or maybe just at me when I first started). You’ve probably got bigger problems to solve than shaving a few ms from the total runtime of your process.

report

reply

[ +- ]

reboot6675@sopuli.xyz

22 points

1 year ago

Just a little nitpicking, O(N+10) and O(2N) are not “effectively” the same as O(N). They are truly the same.

report

reply

[ +- ]

off_brand_@beehaw.org

2 points

1 year ago

Deleted by creator

report

reply

[ +- ]

Scrubbles@poptalk.scrubbles.tech

18 points

1 year ago

This is why I only use bogo sort

report

reply

[ +- ]

ericjmorey@programming.devOP

10 points

1 year ago

With a small enough data set, bogo sort will perform just as well as an O(1) algorithm for sorting for both ascending and descending order!

report

reply

[ +- ]

Scrubbles@poptalk.scrubbles.tech

4 points

1 year ago

That’s using your noodle!

report

reply

[ +- ]

ericjmorey@programming.devOP

4 points

1 year ago

Use it or lose it, right?

report

reply

[ +- ]

mryessir@lemmy.sdf.org

3 points

1 year ago

O(n)

report

reply

[ +- ]

HarkMahlberg@kbin.social

11 points

1 year ago

Aren’t the most commonly accepted sorting algorithms O(nlog(n))? Quicksort? Mergesort? Those are considered bad?

report

reply

[ +- ]

Cawifre@beehaw.org

14 points

1 year ago

I mean, it is entirely reasonable that “bad” is the best performance you can hope for while sorting an entire set of generally comparable items.

If you can abuse special knowledge about the data being sorted then you can get better performance with things like radix sort, but in general it just takes a lot of work to compare them all even if you are clever to avoid wasted effort.

report

reply

[ +- ]

Kajo [he/him] 🌈@beehaw.org

12 points

1 year ago

Yeah, you’re right, it doesn’t make sense to say that O(f(n)) is good or bad for any algorithm. It must be compared to the complexity of other algorithms which solve the same problem in the same conditions.

report

reply

[ +- ]

duncesplayed@lemmy.one

6 points

11 months ago

I mean…yeah. Just because something is provably the best possible thing, doesn’t mean it’s good. Sorting should be avoided if at all possible. (And in many cases, such as with numbers, you can do better than comparison-based sorts)

report

reply

[ +- ]

ericjmorey@programming.devOP

1 point

11 months ago

The labels are from the perspective of viewing the space of all possible functions of element set size to operations, so they don’t apply to any particular problem an algorithm is attempting to solve (that space is often smaller).

report

reply

[ +- ]

luciole (he/him)@beehaw.org

10 points

1 year ago

There’s some simple rules of thumb that can be inferred from Big O notation. For example, I try to be careful with what I do in nested loops whose number of repetitions will grow with application usage. I might be stepping into O(n^2) territory, this is not the place for database queries.

report

reply

[ +- ]

douglasg14b@beehaw.org

1 point

11 months ago

TFW you’re storing 2D and 3D data structures and you read this 😂

report

reply

[ +- ]

luciole (he/him)@beehaw.org

1 point

11 months ago

I don’t see the contradiction. If you’re doing CRUD operations on 10,000 points, I’m sure you’re doing what’s possible to send it to storage in one fell swoop. You most probably get out of those loops any operation that doesn’t need to be repeated as well.

report

reply

[ +- ]

chunkystyles@sopuli.xyz

5 points

1 year ago

I’ve been wrong about the performance of algorithms on tiny data sets before. It’s always best to test your assumptions.

report

reply

Programming

!programming@beehaw.org

All things programming and coding related. Subcommunity of Technology.

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Community stats

69
Monthly active users
320
Posts
3.3K
Comments

Community moderators

alyaza [they/she]@beehaw.org
Hexorg@beehaw.org
Chris Remington@beehaw.org

modlog legal instances join-lemmy.org

lemmy-ui-next v0.11.0 (github)lemmy v0.19.5 (github)