Commits · 5ed04065caa74bb124ead268486b5ff8ed9d7220 · mirrored_repos / MachineLearning / run-llama / Llama Index

This project is mirrored from https://github.com/run-llama/llama_index. Pull mirroring updated 5 minutes ago.

Feb 17, 2024
- fix upgrade script (bm25 nits) (#10624) · 62fa8e2c
  Jerry Liu authored 1 year ago
  
  62fa8e2c
- remove metadata extractors from platform (#10809) · 51e691bd
  Sourabh Desai authored 1 year ago
  
  remove metadata extractors from from platform
  51e691bd
Feb 16, 2024
- wip improved object retrieval (#10513) · 35464903
  Logan authored 1 year ago
  
  35464903
- Logan/merge next (#10676) · a50303f7
  Logan authored 1 year ago
  
  a50303f7
- [version bump] v0.10.5 (#10770) · edd941dd
  Logan authored 1 year ago
  
  View commits for tag v0.10.5 v0.10.5
  
  edd941dd
Feb 15, 2024

ClickHouse as a vector store (#10583) · 9cc433aa
Dale McDiarmid authored 1 year ago

9cc433aa

Fix the perf issue in building nodes from splits. (#10766) · f2d9472e

preemoDez authored 1 year ago

* Fix the perf issue in building nodes from splits.

Create the `relationships` object only once. Otherwise, it recomputes the whole text's hash for every node. It is very inefficient for long text.

An alternative approach would be to cache the hash property. However, it wasn't so straightforward as `Document` isn't a cacheable type. I also do not know Python very well, maybe it would be enough to store a simple null and if it isn't null, then don't recompute? However, the most important reason is I'm not sure about the side effects and the existing assumption that the node is mutable and the hash always reflects the state during the call (unless we modify the object in multiple threads). This change doesn't break any assumptions. If the document was modified while we were creating nodes extracted from it, something would be very wrong.

Benchmarks taken on a document attached to the bug:

Before: Execution time for build_nodes_from_splits: 53.69 seconds

After: Execution time for build_nodes_from_splits: 0.18 seconds

* Fix the formatting

f2d9472e

Added Embeddings Option on custom Triplets (#10629) · c18b3234
abhiram1809 authored 1 year ago

c18b3234

Feb 14, 2024
- [BUG-FIX] retriever_mode param missing when constructing KGTableRetriever (#10725) · 809dc3b1
  Andrei Fajardo authored 1 year ago
  
  add retriever_mode param when constructing KGTableRetriever
  809dc3b1
- Fix: The _indices of KnowledgeGraphIndex method does not update storage context index (#10687) · fc51b9fc
  Rana Banerjee authored 1 year ago
  
  fc51b9fc
- corrected import (#10704) · b6e33d45
  Anoop Sharma authored 1 year ago
  
  b6e33d45
Feb 13, 2024
- Add FunctionComponent alias (#10673) · bf0b0c57
  Andrei Fajardo authored 1 year ago
  
  add FunctionComponent alias
  bf0b0c57
- v0.10.3 version bump (#10662) · 5c1f0562
  Logan authored 1 year ago
  
  View commits for tag v0.10.3 v0.10.3
  
  5c1f0562
- fixes #10654 (#10657) · a61a6a5d
  Kenan Deniz authored 1 year ago
  
  a61a6a5d
- fix simple directory reader (#10655) · f7fba1cf
  Logan authored 1 year ago
  
  f7fba1cf
- fix: add back resolve_embed_model (#10619) · c7e618ca
  Jerry Liu authored 1 year ago
  
  cr
  c7e618ca
- fix as_chat_engine specifying the LLM (#10605) · bf6ad5f1
  Logan authored 1 year ago
  
  bf6ad5f1
- fix base query pipeline notebook (#10609) · a3f7f073
  Jerry Liu authored 1 year ago
  
  a3f7f073
- fix query pipeline agent (#10608) · 6e0cf183
  Jerry Liu authored 1 year ago
  
  6e0cf183
Feb 12, 2024
- Dedup logic for recursive retriever nodes (#10597) · debaaa91
  Haotian Zhang authored 1 year ago
  
  debaaa91
- v0.10.0 (#10537) · 369973f4
  Andrei Fajardo authored 1 year ago
  
  369973f4