I should have started with this:
I am working on a search engine #babelia. Unlike a todoapp, it requires more hardware, having an idea of how much people are willing to spend upfront or by subscription renting a server will help me figure what kind of hardware I need to acquire to be able to test the algorithms.
Thanks for you cooperation! (and please boost!)
I think your low numbers are too high, both for hardware and subscription.
I pay less than 10 bucks for mobile Internet. Yet, somehow, the search engines and all the ad-loaded pages (plus all the hundreds of layers of ad maf... industry) make a living from eating my bandwidth.
It should be 1€. 3 max.
Thanks to chime in!
Pay-per-search does not work as the only revenue to cover infrastructure because you need to pay for hosting / renting anyway so that it is available when someone needs it.
You made me remember that the server sits idle most of the day: if they are 5 persons doing 100 queries per day where each query takes less than 1 second, the server will be fully busy, all cpu core maxed out, 500 seconds per day that means there is still 85900 seconds available (~23.86 hours) on all CPU cores...
What if the server provides more "services" or "applications" like #NextCloud? Would 10€ per month per users makes sense?
The features only of the search engine part: https://etherpad.wikimedia.org/p/babelia
At that does not include taking into account user feedback to be able to program the crawl from the search e.g. via a feed reader.
Also it does not mention interop between search engines which we dive in https://etherpad.wikimedia.org/p/peacesearch
Thanks to chime in!
A worker system like you describe is much more complicated to develop. Also, in such a system people need to trust each other that they will not game the crawl results.
Bandwidth and crawling is necessary to seed the index and then keep it fresh. The initial index can be shared (like Common Crawl Search Engine does, but with less spam). Keeping the index fresh for a family is doable on a regular fiber or DSL connection, it does not consume much.
The storage is not even costly, english wikipedia + stackoverflow does not even reach 100GB.
What is costly is doing a query under one second. For that you need lots of CPU cores / threads like AMD epyc or Threadripper.
Maybe I did not understand what you wrote?
For people who care about, support, or build Free, Libre, and Open Source Software (FLOSS).