Replies: 4 comments
-
|
Hey Hazel, thanks for the detailed feedback really appreciate you sharing all this context. |
Beta Was this translation helpful? Give feedback.
-
|
Regarding 2. and 3. There is an option to continue if a file processing failed: RAISE_INCREMENTAL_LOADING_ERRORS=False It is on by default, but can be set to continue if an error happens In the end it returns the status of all the files processed which succeeded and which failed and why |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
|
Regarding 1. Can you open new help issues with more details regrading each of the issues It will help us keep track and resolve them |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
=Hi all,
I've been really excited about Cognee - it seems like the perfect knowledge graph architecture. I'm trying to use it to set up a 'coworker agent' slackbot with memory pulled from Notion, Slack, and Github, but unfortunately hit several roadblocks in the cognify step. I'm using Temporal mode since I want to prioritize more recent sources, but saw similar issues in base cognify.
My stack is: postgres, pgvector, and neo4j, and currently just running locally on a macbook with M4 chip. Tried both haiku-4-5 + openai embedding large, and then switched to gemini-embedding-001 / gemini-flash-latest for faster performance.
I'm trying to load about 3000 files (32MB) into a single dataset in Cognee.
I saw various blocking errors during Cognify, including:
It seems like if a file fails to Cognify, it blocks all other files. So loading all my files at once, and then Cognifying at the end didn't work because Cognify would crash, and there wasn't an easy way to tell which file(s) had caused the crash. I switched to an approach of adding+cognifying 1 file at a time, skipping the file if Cognify crashed, but that was very slow.
It's pretty hard to understand and keep track of all of the errors + logs. It'd be really nice if there was a 'per source file' view where I could see overall: how many files succeeded/failed, and click into logs per file, and retry cognifying per file.
Cognify seemed unworkably slow for us. It seemed like it'd take 40+ hours to process our initial data.
Thanks and still really excited for Cognee's future!
This discussion was automatically pulled from Discord.
Beta Was this translation helpful? Give feedback.
All reactions