Skip to main content

AwaDB

AwaDB是一个用于LLM应用程序中的嵌入向量的搜索和存储的AI本地数据库。

本笔记本展示了如何使用与AwaDB相关的功能。

pip install awadb
from langchain.text_splitter import CharacterTextSplitter
from langchain.vectorstores import AwaDB
from langchain.document_loaders import TextLoader
loader = TextLoader("../../../state_of_the_union.txt")
documents = loader.load()
text_splitter = CharacterTextSplitter(chunk_size=100, chunk_overlap=0)
docs = text_splitter.split_documents(documents)
db = AwaDB.from_documents(docs)
query = "What did the president say about Ketanji Brown Jackson"
docs = db.similarity_search(query)
print(docs[0].page_content)
    And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.

相似度搜索与分数

返回的距离分数介于0-1之间。0表示不相似,1表示最相似

docs = db.similarity_search_with_score(query)
print(docs[0])
    (Document(page_content='And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence.', metadata={'source': '../../../state_of_the_union.txt'}), 0.561813814013747)

恢复之前创建和添加的表格

AwaDB会自动持久化添加的文档数据

如果您可以恢复之前创建和添加的表格,只需按照以下方式操作:

awadb_client = awadb.Client()
ret = awadb_client.Load("langchain_awadb")
if ret:
print("awadb load table success")
else:
print("awadb load table failed")

awadb load table success