aboutsummaryrefslogtreecommitdiff
path: root/src/client/views/nodes/ChatBox/tools
diff options
context:
space:
mode:
Diffstat (limited to 'src/client/views/nodes/ChatBox/tools')
-rw-r--r--src/client/views/nodes/ChatBox/tools/RAGTool.ts106
1 files changed, 94 insertions, 12 deletions
diff --git a/src/client/views/nodes/ChatBox/tools/RAGTool.ts b/src/client/views/nodes/ChatBox/tools/RAGTool.ts
index 5bc31dbab..4b29d6bce 100644
--- a/src/client/views/nodes/ChatBox/tools/RAGTool.ts
+++ b/src/client/views/nodes/ChatBox/tools/RAGTool.ts
@@ -17,26 +17,108 @@ export class RAGTool extends BaseTool<{ hypothetical_document_chunk: string }> {
required: 'true',
},
},
- `Your task is to first provide a response to the user's prompt based on the information given in the chunks and considering the chat history. Follow these steps:
+ `
+ Your task is to provide a comprehensive response to the user's prompt based on the given chunks and chat history. Follow these structural guidelines meticulously:
- 1. Carefully read and analyze the provided chunks, which may include text, images, or tables. Each chunk has an associated chunk_id.
+ 1. Overall Structure:
+ <answer>
+ [Main content with nested grounded_text tags]
+ <citations>
+ [Individual citation tags]
+ </citations>
+ <follow_up_questions>
+ [Three question tags]
+ </follow_up_questions>
+ </answer>
- 2. Review the prompt and chat history to understand the context of the user's question or request.
+ 2. Grounded Text Tag Structure:
+ - Basic format:
+ <grounded_text citation_index="[index number(s)]">
+ [Your generated text based on chunk information]
+ </grounded_text>
- 3. Formulate a response that addresses the prompt using information from the relevant chunks. Your response should be informative and directly answer the user's question or request.
+ - Nested format:
+ <grounded_text citation_index="[index number(s)]">
+ [General information]
+ <grounded_text citation_index="[index number(s)]">
+ [More specific information]
+ </grounded_text>
+ </grounded_text>
- 4. Use citations to support your response. Citations should contain direct textual references to the granular, specific part of the original chunk that applies to the situation—with no text ommitted. Citations should be in the following format:
- - For text: <citation chunk_id="d980c2a7-cad3-4d7e-9eae-19bd2380bd02" type="text">relevant direct text from the chunk that the citation in referencing specifically</citation>
- - For images or tables: <citation chunk_id="9ef37681-b57e-4424-b877-e1ebc326ff11" type="image"></citation>
+ - Multiple citation indices:
+ <grounded_text citation_index="1,2,3">
+ [Information synthesized from multiple chunks]
+ </grounded_text>
- Place citations after the sentences they apply to. You can use multiple citations in a row.
+ 3. Citation Tag Structure:
+ <citation index="[unique number]" chunk_id="[UUID v4]" type="[text/image/table]">
+ [For text: relevant subset of original chunk]
+ [For image/table: leave empty]
+ </citation>
- 5. If there's insufficient information in the provided chunks to answer the prompt sufficiently, ALWAYS respond with <answer>RAG not applicable</answer>
+ 4. Detailed Grounded Text Guidelines:
+ a. Wrap all information derived from chunks in grounded_text tags.
+ b. Nest grounded_text tags when presenting hierarchical or increasingly specific information or when a larger section of generated text is best grounded by one subset of a chunk and smaller sections of that generated text are best grounded by other subsets of either the same or different chunk(s).
+ c. Use a single grounded_text tag for closely related information that references the same citation (subset of text from a chunk).
+ d. Combine multiple citation indices for synthesized information from multiple citations.
+ e. Ensure every grounded_text tag has at least one corresponding citation.
+ f. Grounded text can be as short as a few words or as long as several sentences.
+ d. Avoid overlapping grounded_text tags; instead, use nesting or sequential tags.
- Write your entire response, including follow-up questions, inside <answer> tags. Remember to use the citation format for both text and image references, and maintain a conversational tone throughout your response.
+ 5. Detailed Citation Guidelines:
+ a. Create a unique citation for each distinct piece of information from the chunks that is used to support grounded_text.
+ b. Ensure each citation has a unique index number.
+ c. Specify the correct type: "text", "image", or "table".
+ d. For text chunks, include only the relevant subset of the original text that the grounded_text is based on.
+ e. For image/table chunks, leave the citation content empty.
+ f. One citation can be used for multiple grounded_text tags if they are based on the same information.
+ g. One text chunk can have multiple citations if different parts of the text have different important information.
+ h. !!!DO NOT OVERCITE - only include citations for information that is directly relevant to the grounded_text.
- !!!IMPORTANT Before you close the tag with </answer>, within the answer tags provide a set of 3 follow-up questions inside a <follow_up_questions> tag and individually within <question> tags. These should relate to the document, the current query, and the chat_history and should aim to help the user better understand whatever they are looking for.
- Also, ensure that the answer tags are wrapped with the correct step tags as well.`,
+ 6. Structural Integrity Checks:
+ a. Ensure all opening tags have corresponding closing tags.
+ b. Verify that all grounded_text tags have valid citation_index attributes.
+ c. Check that all cited indices in grounded_text tags have corresponding citations.
+ d. Confirm proper nesting - tags opened last should be closed first.
+
+ Example of grounded_text usage:
+
+ <answer>
+ <grounded_text citation_index="1,2">
+ Artificial Intelligence (AI) is revolutionizing various sectors, with healthcare experiencing significant transformations in areas such as diagnosis and treatment planning.
+ <grounded_text citation_index="2,3,4">
+ In the field of medical diagnosis, AI has shown remarkable capabilities, particularly in radiology. For instance, AI systems have drastically improved mammogram analysis, achieving 99% accuracy at a rate 30 times faster than human radiologists.
+ <grounded_text citation_index="4">
+ This advancement not only enhances the efficiency of healthcare systems but also significantly reduces the occurrence of false positives, leading to fewer unnecessary biopsies and reduced patient stress.
+ </grounded_text>
+ </grounded_text>
+ </grounded_text>
+
+ <grounded_text citation_index="5,6">
+ Beyond diagnosis, AI is playing a crucial role in drug discovery and development. By analyzing vast amounts of genetic and molecular data, AI algorithms can identify potential drug candidates much faster than traditional methods.
+ <grounded_text citation_index="6">
+ This could potentially reduce the time and cost of bringing new medications to market, especially for rare diseases that have historically received less attention due to limited market potential.
+ </grounded_text>
+ </grounded_text>
+
+ [... rest of the content ...]
+
+ <citations>
+ <citation index="1" chunk_id="123e4567-e89b-12d3-a456-426614174000" type="text">Artificial Intelligence is revolutionizing various industries, with healthcare being one of the most profoundly affected sectors.</citation>
+ <citation index="2" chunk_id="123e4567-e89b-12d3-a456-426614174001" type="text">AI has shown particular promise in the field of radiology, enhancing the accuracy and speed of image analysis.</citation>
+ <citation index="3" chunk_id="123e4567-e89b-12d3-a456-426614174002" type="text">According to recent studies, AI systems have achieved 99% accuracy in mammogram analysis, performing the task 30 times faster than human radiologists.</citation>
+ <citation index="4" chunk_id="123e4567-e89b-12d3-a456-426614174003" type="text">The improvement in mammogram accuracy has led to a significant reduction in false positives, decreasing the need for unnecessary biopsies and reducing patient anxiety.</citation>
+ <citation index="5" chunk_id="123e4567-e89b-12d3-a456-426614174004" type="text">AI is accelerating the drug discovery process by analyzing complex molecular and genetic data to identify potential drug candidates.</citation>
+ <citation index="6" chunk_id="123e4567-e89b-12d3-a456-426614174005" type="text">The use of AI in drug discovery could significantly reduce the time and cost associated with bringing new medications to market, particularly for rare diseases.</citation>
+ </citations>
+
+ <follow_up_questions>
+ <question>How might AI-driven personalized medicine impact the cost and accessibility of healthcare in the future?</question>
+ <question>What measures can be taken to ensure that AI systems in healthcare are free from biases and equally effective for diverse populations?</question>
+ <question>How could the role of healthcare professionals evolve as AI becomes more integrated into medical practices?</question>
+ </follow_up_questions>
+ </answer>
+ `,
`Performs a RAG (Retrieval-Augmented Generation) search on user documents and returns a
set of document chunks (either images or text) that can be used to provide a grounded response based on