当按照快速启动说明并尝试在./run-with-docker-compose.sh后索引文档时:
docsgpt-worker-1 | [2023-10-01 09:36:43,450: INFO/MainProcess] Task application.api.user.tasks.ingest[1eabb828-445e-4bb7-828c-10125e77a741] received
docsgpt-worker-1 | [2023-10-01 09:36:43,451: WARNING/ForkPoolWorker-6] inputs/local/everything.zip
docsgpt-worker-1 | [2023-10-01 09:36:43,484: WARNING/ForkPoolWorker-6] <Response [200]>
docsgpt-worker-1 | [2023-10-01 09:36:51,673: WARNING/ForkPoolWorker-6] Grouping small documents
docsgpt-worker-1 | [2023-10-01 09:36:53,123: WARNING/ForkPoolWorker-6] Separating large documents
docsgpt-worker-1 | [2023-10-01 09:36:53,839: ERROR/ForkPoolWorker-6] Task application.api.user.tasks.ingest[1eabb828-445e-4bb7-828c-10125e77a741] raised unexpected: ValueError('Please provide either elasticsearch_url or cloud_id.')
docsgpt-worker-1 | Traceback (most recent call last):
docsgpt-worker-1 | File "/usr/local/lib/python3.10/site-packages/celery/app/trace.py", line 451, in trace_task
docsgpt-worker-1 | R = retval = fun(*args, **kwargs)
docsgpt-worker-1 | File "/usr/local/lib/python3.10/site-packages/celery/app/trace.py", line 734, in __protected_call__
docsgpt-worker-1 | return self.run(*args, **kwargs)
docsgpt-worker-1 | File "/app/application/api/user/tasks.py", line 6, in ingest
docsgpt-worker-1 | resp = ingest_worker(self, directory, formats, name_job, filename, user)
docsgpt-worker-1 | File "/app/application/worker.py", line 78, in ingest_worker
docsgpt-worker-1 | call_openai_api(docs, full_path, self)
docsgpt-worker-1 | File "/app/application/parser/open_ai_func.py", line 48, in call_openai_api
docsgpt-worker-1 | store = VectorCreator.create_vectorstore(
docsgpt-worker-1 | File "/app/application/vectorstore/vector_creator.py", line 16, in create_vectorstore
docsgpt-worker-1 | return vectorstore_class(*args, **kwargs)
docsgpt-worker-1 | File "/app/application/vectorstore/elasticsearch.py", line 35, in __init__
docsgpt-worker-1 | raise ValueError("Please provide either elasticsearch_url or cloud_id.")
docsgpt-worker-1 | ValueError: Please provide either elasticsearch_url or cloud_id.
docsgpt-backend-1 | [2023-10-01 09:36:58 +0000] [8] [ERROR] Error handling request /api/task_status?task_id=1eabb828-445e-4bb7-828c-10125e77a741
进度条然后保持在1%不动,没有任何内容被索引。
出于好奇,我在.env中尝试了VECTOR_STORE=faiss,但那没有帮助。我回退了几行代码,c1c54f4仍然可以正常工作。
除此之外,非常酷,我正在尝试一个CLI RAG应用程序,这让它变得更好!
1条答案
按热度按时间rbl8hiat1#
如果还没有人做的话,我想参与这个项目。