Jonathan Ellis's profile photo

Jonathan Ellis

Articles

  • Jun 18, 2024 | dev.to | Jonathan Ellis

    In November, Cohere released a dataset containing all of Wikipedia, chunked and embedded to vectors with their multilingual-v3 model. Computing this many embeddings yourself would cost in the neighborhood of $5000, so the public release of this dataset makes creating a semantic, vector-based index of Wikipedia practical for an individual for the first time. Here’s what we’re building: You can try searching the completed index on a public demo instance here.

  • Apr 1, 2024 | thenewstack.io | Jonathan Ellis |Loraine Lawson |Joab Jackson |Kimberley Mok

    Vector indexes are the hottest topic in databases because approximate nearest neighbor (ANN) vector search puts the “R” in RAG (retrieval-augmented generation). “Nearest neighbor” for text embedding models is almost always measured with angular distance — for instance, the cosine between two vectors.

  • Mar 17, 2024 | thenewstack.io | Jack Wallen |Gorkem Ercan |Manish Kapur |Jonathan Ellis

    Python is an object-oriented programming (OOP) language, which means it is centered around the idea of “objects” that can contain data (such as attributes) and code (such as methods). Because Python is an OOP language, it allows for the creation of classes. What are classes? I’m glad you asked. A class is a blueprint that is used to help organize and structure code for reusability and modularity. A good way to think of a class is that it is similar to a blueprint for a house.

  • Mar 16, 2024 | thenewstack.io | David Eastman |Gorkem Ercan |Manish Kapur |Jonathan Ellis

    An important career pivot for any developer — whether forced by layoffs or perhaps a desire to start something new — is moving from one industry sector to another. This might even involve a complete change — from gaming to medical instruments, banking to food manufacturing, etc. Regardless, to be attractive to a potential employer, you must not only bring fresh insights but also an encapsulated understanding of the sector you just left.

  • Mar 6, 2024 | thenewstack.io | Jonathan Ellis |Tim Davis |Yujian Tang |Gorkem Ercan

    Retrieval augmented generation (RAG) is by now a standard part of generative artificial intelligence (AI) applications. Supplementing your application prompt with relevant context retrieved from a vector database can dramatically increase accuracy and reduce hallucinations. This means that increasing relevance in vector search results has a direct correlation to the quality of your RAG application.

Contact details

Socials & Sites

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →