In the world of AI-powered document chat systems, a proposal suggests storing session knowledge vectors in the browser using IndexedDB for session-level document-based AI chat, instead of relying on server-side vector databases.
The article examines the performance implications of server-side vector storage which can introduce latency and scalability issues, especially with increasing concurrent users competing for shared infrastructure resources.
The proposal leverages modern browser capabilities like IndexedDB, highlighting the performance advantages such as handling individual users’ document processing needs effectively.
The browser-based RAG approach is seen as a compelling alternative for session-level document chat applications, with the feasibility demonstrated through a working implementation called Smart Document Workspace.