Supporting blog content/llama index re ranker and elasticsearch re ranker #449

Delacrobix · 2025-04-14T16:49:33Z

No description provided.

gitnotebooks · 2025-04-14T16:49:37Z

Found 1 changed notebook. Review the changes at https://app.gitnotebooks.com/elastic/elasticsearch-labs/pull/449

carlyrichmond

I've added a few pieces of feedback. I'm keen to understand the in-memory vector store choice specifically.

carlyrichmond · 2025-05-20T10:04:30Z

...og-content/llamaindex-re-ranker-and-elasticsearch-re-ranker/llamaindex_rerank_notebook.ipynb

+      "source": [
+        "# LlamaIndex re-ranker and Elasticsearch re-ranker: Comparison review\n",
+        "\n",
+        "This notebook demonstrates how to use AutoGen with Elasticsearch. This notebook is based on the article [LlamaIndex re-ranker and Elasticsearch re-ranker: Comparison review](https://www.elastic.co/search-labs/blog/llamaIndex-reranker-and-elasticsearch-reranker-comparison-review)."


Can you make sure the description matches the piece? This is using the old title which is misleading as discussed before.

Oh! I'm so sorry for that mistake, it was fixed in the most recent commit.

carlyrichmond · 2025-05-20T10:06:29Z

...og-content/llamaindex-re-ranker-and-elasticsearch-re-ranker/llamaindex_rerank_notebook.ipynb

+      },
+      "outputs": [],
+      "source": [
+        "%pip install llama-index-core llama-index-llms-openai rank-llm llama-index-postprocessor-rankgpt-rerank llama-index-embeddings-openai"


Is this list of installs complete? If I remember correctly getpass is available in the Python standard library, but I'm not sure that nest_asyncio is. Can you confirm?

Yes, the package nest_asyncio is also included in the Python standard library, so there's no need to install any additional packages.

carlyrichmond · 2025-05-20T10:06:34Z

...og-content/llamaindex-re-ranker-and-elasticsearch-re-ranker/llamaindex_rerank_notebook.ipynb

+      "source": [
+        "# LlamaIndex re-ranker and Elasticsearch re-ranker: Comparison review\n",
+        "\n",
+        "This notebook demonstrates how to use AutoGen with Elasticsearch. This notebook is based on the article [LlamaIndex re-ranker and Elasticsearch re-ranker: Comparison review](https://www.elastic.co/search-labs/blog/llamaIndex-reranker-and-elasticsearch-reranker-comparison-review)."


Can you make sure the description matches the piece? This is using the old title which is misleading as discussed before.

carlyrichmond · 2025-05-20T10:08:26Z

...og-content/llamaindex-re-ranker-and-elasticsearch-re-ranker/llamaindex_rerank_notebook.ipynb

+      },
+      "outputs": [],
+      "source": [
+        "%pip install llama-index-core llama-index-llms-openai rank-llm llama-index-postprocessor-rankgpt-rerank llama-index-embeddings-openai"


Does it make sense to use a requirements.txt instead?

I think that using a requerements.txt file adds unnecessary complexity to the notebook. The standard approach is to use the pip install command in individual notebooks.

carlyrichmond · 2025-05-20T10:14:50Z

...og-content/llamaindex-re-ranker-and-elasticsearch-re-ranker/llamaindex_rerank_notebook.ipynb

+        "\n",
+        "    document_objects.append(Document(text=text_content))\n",
+        "\n",
+        "index = VectorStoreIndex.from_documents(document_objects)"


Is there a particular reason that you're using an in-memory vector store at this point and not Elasticsearch? I found this surprising, especially as we are still making reference to comparing the journeys in the piece. It doesn't feel like a fair comparison to me. Should this example be changed to use Elasticsearch as the vector store?

If there's a reason why not and we agree, can you add a comment in this section that you are using the in-memory store at this point and not Elasticsearch? Until I got to that section of the article it wasn't clear to me.

The idea was to compare how easy is setting up each of the two approaches to do reranking, and explaining that quickstart efforts are similar but with LlamaIndex you end up with a local only implementation, and Elasticsearch unblocks a more robust solution. The focus was on reranking , and not on the data store.

Another way to see it is comparing an existing Elasticsearch implementation, and focus on the data store too. In that case what you say make sense and we should use the Elasticsearch vector store.

After a deep thought I think I like the alternative your comments suggest more and focusing on getting the same implementation and not only comparing reranking will be more valuable and straightforward to follow.

We will make the changes and let you know. Thanks for your feedback!

Hi again, @carlyrichmond. I made some changes to the notebook's source code. Now, both tests use the same Elasticsearch index to store and read the dataset.

Let me know if this makes sense to you.

carlyrichmond · 2025-05-20T10:22:03Z

...og-content/llamaindex-re-ranker-and-elasticsearch-re-ranker/llamaindex_rerank_notebook.ipynb

+      "source": [
+        "## Cleaning environment\n",
+        "\n",
+        "Delete the resources used to prevent them from consuming resources."


Do we need to do any cleanup of resources for the RankGPT reranker stage?

No, it's not needed. RankGPTRerank is just an object instance, and there's no special resource allocation that requires cleanup.

… llama-index tests

Delacrobix added 2 commits April 14, 2025 11:45

rerankers article notebook

3578e4b

renaming folder

0974b88

Delacrobix added 2 commits April 21, 2025 12:30

notebook changes

3b2ec45

Upgrading elastic library to v9

388a2d7

carlyrichmond requested changes May 20, 2025

View reviewed changes

Delacrobix added 4 commits May 20, 2025 17:43

Fixing issues

18557f9

Removing numeric fields from semantic_field

d721831

storing data in elasticsearch and using it for both, Elasticsearch an…

330f386

… llama-index tests

Removing dependencies

15dcd9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Supporting blog content/llama index re ranker and elasticsearch re ranker #449

Supporting blog content/llama index re ranker and elasticsearch re ranker #449

Uh oh!

Delacrobix commented Apr 14, 2025

Uh oh!

gitnotebooks bot commented Apr 14, 2025

Uh oh!

carlyrichmond left a comment

Uh oh!

carlyrichmond May 20, 2025

Uh oh!

Delacrobix May 24, 2025

Uh oh!

carlyrichmond May 20, 2025

Uh oh!

Delacrobix May 24, 2025

Uh oh!

carlyrichmond May 20, 2025

Uh oh!

carlyrichmond May 20, 2025

Uh oh!

Delacrobix May 24, 2025

Uh oh!

carlyrichmond May 20, 2025

Uh oh!

Delacrobix May 24, 2025

Uh oh!

Delacrobix May 30, 2025

Uh oh!

carlyrichmond May 20, 2025

Uh oh!

Delacrobix May 24, 2025

Uh oh!

Uh oh!

Supporting blog content/llama index re ranker and elasticsearch re ranker #449

Are you sure you want to change the base?

Supporting blog content/llama index re ranker and elasticsearch re ranker #449

Uh oh!

Conversation

Delacrobix commented Apr 14, 2025

Uh oh!

gitnotebooks bot commented Apr 14, 2025

Uh oh!

carlyrichmond left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!