Blog

2023

Decoding Similarity Search with FAISS: A Practical Approach

29 August 2023·1395 words·7 mins

In this blog post, you read about FAISS and your options for implementing similarity search using FAISS.

Searching through images using the CLIP model

12 July 2023·1084 words·6 mins

A walktrough for writing an application that is searching through images using the clip model. An explanation of the model, plus python code.

Vector search using Langchain, Weaviate and OpenSearch

30 May 2023·1176 words·6 mins

With the popularity of ChatGPT and Large Language Models (LLM), everybody is talking about it. On my Linkedin home page, about 90% of the posts seem to speak about ChatGPT, AI, and LLM. With my experience in search solutions and interest in everything related to Natural Language Processing and Search, I also had to start working on solutions.

Introduction of Frank and Amy at HaystackConf USA

2 May 2023·1311 words·7 mins

Last week, at the end of April 2023, I attended the HaystackConf in Charlottesville, USA. HaystackConf is the conference to participate in if you are a search relevance engineer. With over a hundred people attending on-site and more than a hundred online, this is a good representation of the almost 4000 members at the time of writing in the search relevance community on Slack.

Learn Elasticsearch from my liveProject at Manning

13 February 2023·704 words·4 mins

News Search

Learn about the creation process for a Manning liveProject. The liveProject I created is called Elasticsearch for a search API.

2021

Java with Lambda does not have to be slow

29 March 2021·923 words·5 mins

Cloud

As a seasoned Java programmer, usually with Spring Boot, it hurts to learn that most lambdas use JavaScript/TypeScript. What if you want to keep writing your Lambda in Java? A query in google teaches us that Quarkus is the way to go. This blog post introduces Quarkus, GraalVM, and we top it off with AWS CDK to deploy the sample.

AWS Lambda here, there and everywhere

25 March 2021·2115 words·10 mins

Cloud

Everybody is talking about serverless, and with serverless comes serverless functions. Small pieces of code (in theory) that receive an event (input) and return a message (output). They do not need a server. Therefore they are serverless, and they scale on demand. On the AWS platform, a serverless function is called a Lambda. In theory, you write the code, and AWS takes care of the rest. There is a reason why I put “in theory” in some locations in the text. Even writing the hello world sample does need some authorization configuration, and when running complete Docker images as a lambda, you cannot talk about small pieces of code. Time to focus on typical usage patterns for lambdas.

Deploying a secure AWS Elasticsearch cluster using cdk

3 March 2021·1824 words·9 mins

Cloud Search Security

If there is one thing that I have seen a lot with elasticsearch clusters in the wild, it is problems with security. The out of the box Elasticsearch installation does not have a user_id password protection like most databases do. Using the basic license these days does give you the option, but you still have to configure it yourself. When AWS introduced their AWS Elasticsearch service, it did not configure security out of the box. Therefore Elasticsearch clusters have been famous for exposed data. Check an article like this, but there are lots more. Does this mean Elasticsearch should not be used? No, but you must know what you are doing.

Let’s talk about the Elastic license change

12 February 2021·1850 words·9 mins

Is it a bomb under the open source model, or a genius move to protect their business from pirates? On January 14, 2021 a blog post was published on Elastic’s website, titled: licensing change. In this blog post, they announced a license change in the next release of all their tools (including Elasticsearch and Kibana). The purpose of this change is mainly to block AWS: Elastic wants to make it impossible for AWS to create its own competing managed Elasticsearch service. The announced changes were not very well received in the open source world. In this blog post I tried to give an overview of what happened, and what the consequences are for companies that use Elasticsearch and Kibana. Just a disclaimer, I am not a lawyer. Hence I will not go into the content of the new license model. The aim of this post is mainly to provide an overview of varying opinions on the internet and the possible consequences for users.

2019

Tip for Configuring Oh My Z Shell

4 January 2019·239 words·2 mins

Development

A quick tip on how to extend oh-my-zsh using custom shell scripts to switch JDKs or modify your PATH.

↑