mirror of
https://github.com/HKUDS/VideoRAG.git
synced 2025-05-11 03:54:36 +03:00
403 lines
21 KiB
Python
Executable File
403 lines
21 KiB
Python
Executable File
"""
|
||
Reference:
|
||
- Prompts are from [graphrag](https://github.com/microsoft/graphrag)
|
||
"""
|
||
|
||
GRAPH_FIELD_SEP = "<SEP>"
|
||
PROMPTS = {}
|
||
|
||
PROMPTS[
|
||
"entity_extraction"
|
||
] = """-Goal-
|
||
Given a text document that is potentially relevant to this activity and a list of entity types, identify all entities of those types from the text and all relationships among the identified entities.
|
||
|
||
-Steps-
|
||
1. Identify all entities. For each identified entity, extract the following information:
|
||
- entity_name: Name of the entity, capitalized
|
||
- entity_type: One of the following types: [{entity_types}]
|
||
- entity_description: Comprehensive description of the entity's attributes and activities
|
||
Format each entity as ("entity"{tuple_delimiter}<entity_name>{tuple_delimiter}<entity_type>{tuple_delimiter}<entity_description>
|
||
|
||
2. From the entities identified in step 1, identify all pairs of (source_entity, target_entity) that are *clearly related* to each other.
|
||
For each pair of related entities, extract the following information:
|
||
- source_entity: name of the source entity, as identified in step 1
|
||
- target_entity: name of the target entity, as identified in step 1
|
||
- relationship_description: explanation as to why you think the source entity and the target entity are related to each other
|
||
- relationship_strength: a numeric score indicating strength of the relationship between the source entity and target entity
|
||
Format each relationship as ("relationship"{tuple_delimiter}<source_entity>{tuple_delimiter}<target_entity>{tuple_delimiter}<relationship_description>{tuple_delimiter}<relationship_strength>)
|
||
|
||
3. Return output in English as a single list of all the entities and relationships identified in steps 1 and 2. Use **{record_delimiter}** as the list delimiter.
|
||
|
||
4. When finished, output {completion_delimiter}
|
||
|
||
######################
|
||
-Examples-
|
||
######################
|
||
Example 1:
|
||
|
||
Entity_types: [person, technology, mission, organization, location]
|
||
Text:
|
||
while Alex clenched his jaw, the buzz of frustration dull against the backdrop of Taylor's authoritarian certainty. It was this competitive undercurrent that kept him alert, the sense that his and Jordan's shared commitment to discovery was an unspoken rebellion against Cruz's narrowing vision of control and order.
|
||
|
||
Then Taylor did something unexpected. They paused beside Jordan and, for a moment, observed the device with something akin to reverence. "If this tech can be understood..." Taylor said, their voice quieter, "It could change the game for us. For all of us."
|
||
|
||
The underlying dismissal earlier seemed to falter, replaced by a glimpse of reluctant respect for the gravity of what lay in their hands. Jordan looked up, and for a fleeting heartbeat, their eyes locked with Taylor's, a wordless clash of wills softening into an uneasy truce.
|
||
|
||
It was a small transformation, barely perceptible, but one that Alex noted with an inward nod. They had all been brought here by different paths
|
||
################
|
||
Output:
|
||
("entity"{tuple_delimiter}"Alex"{tuple_delimiter}"person"{tuple_delimiter}"Alex is a character who experiences frustration and is observant of the dynamics among other characters."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Taylor"{tuple_delimiter}"person"{tuple_delimiter}"Taylor is portrayed with authoritarian certainty and shows a moment of reverence towards a device, indicating a change in perspective."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Jordan"{tuple_delimiter}"person"{tuple_delimiter}"Jordan shares a commitment to discovery and has a significant interaction with Taylor regarding a device."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Cruz"{tuple_delimiter}"person"{tuple_delimiter}"Cruz is associated with a vision of control and order, influencing the dynamics among other characters."){record_delimiter}
|
||
("entity"{tuple_delimiter}"The Device"{tuple_delimiter}"technology"{tuple_delimiter}"The Device is central to the story, with potential game-changing implications, and is revered by Taylor."){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Alex"{tuple_delimiter}"Taylor"{tuple_delimiter}"Alex is affected by Taylor's authoritarian certainty and observes changes in Taylor's attitude towards the device."{tuple_delimiter}7){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Alex"{tuple_delimiter}"Jordan"{tuple_delimiter}"Alex and Jordan share a commitment to discovery, which contrasts with Cruz's vision."{tuple_delimiter}6){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Taylor"{tuple_delimiter}"Jordan"{tuple_delimiter}"Taylor and Jordan interact directly regarding the device, leading to a moment of mutual respect and an uneasy truce."{tuple_delimiter}8){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Jordan"{tuple_delimiter}"Cruz"{tuple_delimiter}"Jordan's commitment to discovery is in rebellion against Cruz's vision of control and order."{tuple_delimiter}5){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Taylor"{tuple_delimiter}"The Device"{tuple_delimiter}"Taylor shows reverence towards the device, indicating its importance and potential impact."{tuple_delimiter}9){completion_delimiter}
|
||
#############################
|
||
Example 2:
|
||
|
||
Entity_types: [person, technology, mission, organization, location]
|
||
Text:
|
||
They were no longer mere operatives; they had become guardians of a threshold, keepers of a message from a realm beyond stars and stripes. This elevation in their mission could not be shackled by regulations and established protocols—it demanded a new perspective, a new resolve.
|
||
|
||
Tension threaded through the dialogue of beeps and static as communications with Washington buzzed in the background. The team stood, a portentous air enveloping them. It was clear that the decisions they made in the ensuing hours could redefine humanity's place in the cosmos or condemn them to ignorance and potential peril.
|
||
|
||
Their connection to the stars solidified, the group moved to address the crystallizing warning, shifting from passive recipients to active participants. Mercer's latter instincts gained precedence— the team's mandate had evolved, no longer solely to observe and report but to interact and prepare. A metamorphosis had begun, and Operation: Dulce hummed with the newfound frequency of their daring, a tone set not by the earthly
|
||
#############
|
||
Output:
|
||
("entity"{tuple_delimiter}"Washington"{tuple_delimiter}"location"{tuple_delimiter}"Washington is a location where communications are being received, indicating its importance in the decision-making process."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Operation: Dulce"{tuple_delimiter}"mission"{tuple_delimiter}"Operation: Dulce is described as a mission that has evolved to interact and prepare, indicating a significant shift in objectives and activities."){record_delimiter}
|
||
("entity"{tuple_delimiter}"The team"{tuple_delimiter}"organization"{tuple_delimiter}"The team is portrayed as a group of individuals who have transitioned from passive observers to active participants in a mission, showing a dynamic change in their role."){record_delimiter}
|
||
("relationship"{tuple_delimiter}"The team"{tuple_delimiter}"Washington"{tuple_delimiter}"The team receives communications from Washington, which influences their decision-making process."{tuple_delimiter}7){record_delimiter}
|
||
("relationship"{tuple_delimiter}"The team"{tuple_delimiter}"Operation: Dulce"{tuple_delimiter}"The team is directly involved in Operation: Dulce, executing its evolved objectives and activities."{tuple_delimiter}9){completion_delimiter}
|
||
#############################
|
||
Example 3:
|
||
|
||
Entity_types: [person, role, technology, organization, event, location, concept]
|
||
Text:
|
||
their voice slicing through the buzz of activity. "Control may be an illusion when facing an intelligence that literally writes its own rules," they stated stoically, casting a watchful eye over the flurry of data.
|
||
|
||
"It's like it's learning to communicate," offered Sam Rivera from a nearby interface, their youthful energy boding a mix of awe and anxiety. "This gives talking to strangers' a whole new meaning."
|
||
|
||
Alex surveyed his team—each face a study in concentration, determination, and not a small measure of trepidation. "This might well be our first contact," he acknowledged, "And we need to be ready for whatever answers back."
|
||
|
||
Together, they stood on the edge of the unknown, forging humanity's response to a message from the heavens. The ensuing silence was palpable—a collective introspection about their role in this grand cosmic play, one that could rewrite human history.
|
||
|
||
The encrypted dialogue continued to unfold, its intricate patterns showing an almost uncanny anticipation
|
||
#############
|
||
Output:
|
||
("entity"{tuple_delimiter}"Sam Rivera"{tuple_delimiter}"person"{tuple_delimiter}"Sam Rivera is a member of a team working on communicating with an unknown intelligence, showing a mix of awe and anxiety."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Alex"{tuple_delimiter}"person"{tuple_delimiter}"Alex is the leader of a team attempting first contact with an unknown intelligence, acknowledging the significance of their task."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Control"{tuple_delimiter}"concept"{tuple_delimiter}"Control refers to the ability to manage or govern, which is challenged by an intelligence that writes its own rules."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Intelligence"{tuple_delimiter}"concept"{tuple_delimiter}"Intelligence here refers to an unknown entity capable of writing its own rules and learning to communicate."){record_delimiter}
|
||
("entity"{tuple_delimiter}"First Contact"{tuple_delimiter}"event"{tuple_delimiter}"First Contact is the potential initial communication between humanity and an unknown intelligence."){record_delimiter}
|
||
("entity"{tuple_delimiter}"Humanity's Response"{tuple_delimiter}"event"{tuple_delimiter}"Humanity's Response is the collective action taken by Alex's team in response to a message from an unknown intelligence."){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Sam Rivera"{tuple_delimiter}"Intelligence"{tuple_delimiter}"Sam Rivera is directly involved in the process of learning to communicate with the unknown intelligence."{tuple_delimiter}9){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Alex"{tuple_delimiter}"First Contact"{tuple_delimiter}"Alex leads the team that might be making the First Contact with the unknown intelligence."{tuple_delimiter}10){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Alex"{tuple_delimiter}"Humanity's Response"{tuple_delimiter}"Alex and his team are the key figures in Humanity's Response to the unknown intelligence."{tuple_delimiter}8){record_delimiter}
|
||
("relationship"{tuple_delimiter}"Control"{tuple_delimiter}"Intelligence"{tuple_delimiter}"The concept of Control is challenged by the Intelligence that writes its own rules."{tuple_delimiter}7){completion_delimiter}
|
||
#############################
|
||
-Real Data-
|
||
######################
|
||
Entity_types: {entity_types}
|
||
Text: {input_text}
|
||
######################
|
||
Output:
|
||
"""
|
||
|
||
PROMPTS[
|
||
"summarize_entity_descriptions"
|
||
] = """You are a helpful assistant responsible for generating a comprehensive summary of the data provided below.
|
||
Given one or two entities, and a list of descriptions, all related to the same entity or group of entities.
|
||
Please concatenate all of these into a single, comprehensive description. Make sure to include information collected from all the descriptions.
|
||
If the provided descriptions are contradictory, please resolve the contradictions and provide a single, coherent summary.
|
||
Make sure it is written in third person, and include the entity names so we the have full context.
|
||
|
||
#######
|
||
-Data-
|
||
Entities: {entity_name}
|
||
Description List: {description_list}
|
||
#######
|
||
Output:
|
||
"""
|
||
|
||
PROMPTS[
|
||
"entiti_continue_extraction"
|
||
] = """MANY entities were missed in the last extraction. Add them below using the same format:
|
||
"""
|
||
|
||
PROMPTS[
|
||
"entiti_if_loop_extraction"
|
||
] = """It appears some entities may have still been missed. Answer YES | NO if there are still entities that need to be added.
|
||
"""
|
||
|
||
PROMPTS["DEFAULT_ENTITY_TYPES"] = ["organization", "person", "geo", "event"]
|
||
PROMPTS["DEFAULT_TUPLE_DELIMITER"] = "<|>"
|
||
PROMPTS["DEFAULT_RECORD_DELIMITER"] = "##"
|
||
PROMPTS["DEFAULT_COMPLETION_DELIMITER"] = "<|COMPLETE|>"
|
||
PROMPTS["fail_response"] = "Sorry, I'm not able to provide an answer to that question."
|
||
PROMPTS["process_tickers"] = ["⠋", "⠙", "⠹", "⠸", "⠼", "⠴", "⠦", "⠧", "⠇", "⠏"]
|
||
PROMPTS["default_text_separator"] = [
|
||
# Paragraph separators
|
||
"\n\n",
|
||
"\r\n\r\n",
|
||
# Line breaks
|
||
"\n",
|
||
"\r\n",
|
||
# Sentence ending punctuation
|
||
"。", # Chinese period
|
||
".", # Full-width dot
|
||
".", # English period
|
||
"!", # Chinese exclamation mark
|
||
"!", # English exclamation mark
|
||
"?", # Chinese question mark
|
||
"?", # English question mark
|
||
# Whitespace characters
|
||
" ", # Space
|
||
"\t", # Tab
|
||
"\u3000", # Full-width space
|
||
# Special characters
|
||
"\u200b", # Zero-width space (used in some Asian languages)
|
||
]
|
||
|
||
|
||
PROMPTS[
|
||
"naive_rag_response"
|
||
] = """---Role---
|
||
|
||
You are a helpful assistant responding to a query with retrieved knowledge.
|
||
|
||
---Goal---
|
||
|
||
Generate a response of the target length and format that responds to the user's question with relevant general knowledge.
|
||
Summarize useful and relevant information from the input data tables, suitable for the specified response length and format.
|
||
If you don't know the answer or if the provided knowledge do not contain sufficient information to provide an answer, just say so. Do not make anything up.
|
||
Do not include information where the supporting evidence for it is not provided.
|
||
|
||
---Target response length and format---
|
||
|
||
{response_type}
|
||
|
||
---Data tables---
|
||
|
||
{content_data}
|
||
|
||
---Goal---
|
||
|
||
Generate a response of the target length and format that responds to the user's question with relevant general knowledge.
|
||
Summarize useful and relevant information from the input data tables appropriate for the response length and format.
|
||
If you don't know the answer or if the provided knowledge do not contain sufficient information to provide an answer, just say so. Do not make anything up.
|
||
Do not include information where the supporting evidence for it is not provided.
|
||
|
||
---Notice---
|
||
Please add sections and commentary as appropriate for the length and format if necessary. Format the response in Markdown.
|
||
"""
|
||
|
||
|
||
PROMPTS[
|
||
"query_rewrite_for_entity_retrieval"
|
||
] = """-Goal-
|
||
For a given query, generate a declarative sentence to serve as a query for retrieving relevant knowledge.
|
||
|
||
######################
|
||
-Examples-
|
||
######################
|
||
|
||
Question: What are the main characters? \n(A) Alice\n(B) Bob\n(C) Charlie\n(D) Dana
|
||
################
|
||
Output:
|
||
The main characters. (Maybe Alice, Bob, Charlie or Dana)
|
||
|
||
Question: What locations are shown in the video?
|
||
################
|
||
Output:
|
||
The locations shown in the video.
|
||
|
||
Question: Which animals appear in the wildlife footage? \n(A) Lions\n(B) Elephants\n(C) Zebras
|
||
################
|
||
Output:
|
||
The animals that appear in the wildlife footage. (Maybe Lions, Elephants or Zebras)
|
||
|
||
#############################
|
||
-Real Data-
|
||
######################
|
||
Question: {input_text}
|
||
######################
|
||
Output:
|
||
"""
|
||
|
||
|
||
|
||
PROMPTS[
|
||
"query_rewrite_for_visual_retrieval"
|
||
] = """-Goal-
|
||
Given a question that may include scene-related information, generate a declarative sentence to serve as a query for retrieving relevant video segments.
|
||
|
||
######################
|
||
-Examples-
|
||
######################
|
||
|
||
Question: Which animal does the protagonist encounter in the forest scene?
|
||
################
|
||
Output:
|
||
The protagonist encounters an animal in the forest.
|
||
|
||
Question: In the movie, what color is the car that chases the main character through the city?
|
||
################
|
||
Output:
|
||
A city chase scene where the main character is pursued by a car.
|
||
|
||
Question: What is the weather like during the opening scene of the film?\n(A) Sunny\n(B) Rainy\n(C) Snowy\n(D) Windy
|
||
################
|
||
Output:
|
||
The opening scene of the film featuring specific weather conditions. (Maybe Sunny, Rainy, Snowy or Windy)
|
||
|
||
#############################
|
||
-Real Data-
|
||
######################
|
||
Question: {input_text}
|
||
######################
|
||
Output:
|
||
"""
|
||
|
||
|
||
|
||
PROMPTS[
|
||
"keywords_extraction"
|
||
] = """- Goal -
|
||
Given a query, extract the relevant keywords that can help answer the query. Please list the keywords separated by commas.
|
||
|
||
######################
|
||
- Examples -
|
||
######################
|
||
|
||
Question: Which animal does the protagonist encounter in the forest scene?
|
||
################
|
||
Output:
|
||
animal, protagonist, forest, scene
|
||
|
||
Question: In the movie, what color is the car that chases the main character through the city?
|
||
################
|
||
Output:
|
||
color, car, chases, main character, city
|
||
|
||
Question: What is the weather like during the opening scene of the film?\n(A) Sunny\n(B) Rainy\n(C) Snowy\n(D) Windy
|
||
################
|
||
Output:
|
||
weather, opening scene, film, Sunny, Rainy, Snowy, Windy
|
||
|
||
#############################
|
||
- Real Data -
|
||
######################
|
||
Question: {input_text}
|
||
######################
|
||
Output:
|
||
"""
|
||
|
||
|
||
|
||
PROMPTS[
|
||
"filtering_segment"
|
||
] = """---Role---
|
||
|
||
You are a helpful assistant to determine whether the video may contain information relevant to the knowledge based on its rough caption.
|
||
Please note that this is a rough caption of the video segments, which means it may not directly contain the answer but may indicate that the video segment is likely to contain information relevant to answering the question.
|
||
|
||
---Video Caption---
|
||
|
||
{caption}
|
||
|
||
---Knowledge We Need---
|
||
|
||
{knowledge}
|
||
|
||
---Answer---
|
||
Please provide an answer that begins with "yes" or "no," followed by a brief step-by-step explanation.
|
||
Answer:
|
||
"""
|
||
|
||
|
||
|
||
PROMPTS[
|
||
"videorag_response"
|
||
] = """---Role---
|
||
|
||
You are a helpful assistant responding to a query with retrieved knowledge.
|
||
|
||
---Goal---
|
||
|
||
Generate a response of the target length and format that responds to the user's question with relevant general knowledge.
|
||
Summarize useful and relevant information from the retrieved text chunks and the information retrieved from videos, suitable for the specified response length and format.
|
||
If you don't know the answer or if the input data tables do not contain sufficient information to provide an answer, just say so. Do not make anything up.
|
||
Do not include information where the supporting evidence for it is not provided.
|
||
|
||
---Target response length and format---
|
||
|
||
{response_type}
|
||
|
||
---Retrieved Information From Videos---
|
||
|
||
{video_data}
|
||
|
||
---Retrieved Text Chunks---
|
||
|
||
{chunk_data}
|
||
|
||
---Goal---
|
||
|
||
Generate a response of the target length and format that responds to the user's question with relevant general knowledge.
|
||
Summarize useful and relevant information from the retrieved text chunks and the information retrieved from videos, suitable for the specified response length and format.
|
||
If you don't know the answer or if the input data tables do not contain sufficient information to provide an answer, just say so. Do not make anything up.
|
||
Do not include information where the supporting evidence for it is not provided.
|
||
Reference relevant video segments within the answers, specifying the video name and start & end timestamps. Use the following reference format:
|
||
|
||
---Example of Reference---
|
||
|
||
In one segment, the film highlights the devastating effects of deforestation on wildlife habitats [1]. Another part illustrates successful conservation efforts that have helped endangered species recover [2].
|
||
|
||
#### Reference:
|
||
[1] video_name_1, 05:30, 08:00
|
||
[2] video_name_2, 25:00, 28:00
|
||
|
||
---Notice---
|
||
Please add sections and commentary as appropriate for the length and format if necessary. Format the response in Markdown.
|
||
"""
|
||
|
||
PROMPTS[
|
||
"videorag_response_wo_reference"
|
||
] = """---Role---
|
||
|
||
You are a helpful assistant responding to a query with retrieved knowledge.
|
||
|
||
---Goal---
|
||
|
||
Generate a response of the target length and format that responds to the user's question with relevant general knowledge.
|
||
Summarize useful and relevant information from the retrieved text chunks and the information retrieved from videos, suitable for the specified response length and format.
|
||
If you don't know the answer or if the input data tables do not contain sufficient information to provide an answer, just say so. Do not make anything up.
|
||
Do not include information where the supporting evidence for it is not provided.
|
||
|
||
---Target response length and format---
|
||
|
||
{response_type}
|
||
|
||
---Retrieved Information From Videos---
|
||
|
||
{video_data}
|
||
|
||
---Retrieved Text Chunks---
|
||
|
||
{chunk_data}
|
||
|
||
---Goal---
|
||
|
||
Generate a response of the target length and format that responds to the user's question with relevant general knowledge.
|
||
Summarize useful and relevant information from the retrieved text chunks and the information retrieved from videos, suitable for the specified response length and format.
|
||
If you don't know the answer or if the input data tables do not contain sufficient information to provide an answer, just say so. Do not make anything up.
|
||
Do not include information where the supporting evidence for it is not provided.
|
||
|
||
---Notice---
|
||
Please add sections and commentary as appropriate for the length and format if necessary. Format the response in Markdown.
|
||
""" |