this post was submitted on 17 Jul 2023
25 points (96.3% liked)
Experienced Devs
3961 readers
3 users here now
A community for discussion amongst professional software developers.
Posts should be relevant to those well into their careers.
For those looking to break into the industry, are hustling for their first job, or have just started their career and are looking for advice, check out:
- Logo base by Delapouite under CC BY 3.0 with modifications to add a gradient
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Few reasons. Really good search is hard to do, which means expensive. Reddit probably doesn't have the architecture in place to do a quality search. Not that the architecture couldn't be changed, but it costs money. Another reason is that search engines exist. Why replicate the intricate details of a search engine when it's not your core business? Especially when everyone can use the search engines to search Reddit.
I would think there are plenty of “off the shelf” systems they could buy for this purpose, but they’re too cheap to do it. I mean all the Reddit post and comment data is in a database to begin with.
There are "off the shelf" systems, for a sufficiently broad interpretation of "off the shelf." But they are not cheap (requiring probably a dedicated team just to properly configure and maintain, and probably also requiring significant rearchitecturing of your application's data), and are usually still quite shitty even after all that.
Search is just very, very hard. Much harder than even experienced devs who have not worked in the area appreciate.
Source: I am a dev on a major search engine. No, not that one, but one you have definitely used many times.
I know it's cool to shit on Reddit (and I dislike them too!), but this really is a technical issue. Stuff being in a database doesn't mean that you can magically do good searching without anything. Off the shelf systems exist for off the shelf products, problem with those solutions is that once you differ significantly from the target type of project, it costs more and more to integrate. And since Reddit is pretty unique (if you also account for its scale), it doesn't make financial sense to make a product that's optimized for Reddit.