Session

A Session is a conversation instance between the user and UFO. It is a continuous interaction that starts when the user initiates a request and ends when the request is completed. UFO supports multiple requests within the same session. Each request is processed sequentially, by a Round of interaction, until the user's request is fulfilled. We show the relationship between Session and Round in the following figure:

Session Lifecycle

The lifecycle of a Session is as follows:

1. Session Initialization

A Session is initialized when the user starts a conversation with UFO. The Session object is created, and the first Round of interaction is initiated. At this stage, the user's request is processed by the HostAgent to determine the appropriate application to fulfill the request. The Context object is created to store the state of the conversation shared across all Rounds within the Session.

2. Session Processing

Once the Session is initialized, the Round of interaction begins, which completes a single user request by orchestrating the HostAgent and AppAgent.

3. Next Round

After the completion of the first Round, the Session requests the next request from the user to start the next Round of interaction. This process continues until there are no more requests from the user. The core logic of a Session is shown below:

def run(self) -> None:
    """
    Run the session.
    """

    while not self.is_finished():

        round = self.create_new_round()
        if round is None:
            break
        round.run()

    if self.application_window is not None:
        self.capture_last_snapshot()

    if self._should_evaluate and not self.is_error():
        self.evaluation()

    self.print_cost()

4. Session Termination

If the user has no more requests or decides to end the conversation, the Session is terminated, and the conversation ends. The EvaluationAgent evaluates the completeness of the Session if it is configured to do so.

Reference

Bases: ABC

A basic session in UFO. A session consists of multiple rounds of interactions and conversations.

Initialize a session.

Parameters:	`task` (`str`) – The name of current task. `should_evaluate` (`bool`) – Whether to evaluate the session. `id` (`int`) – The id of the session.

Source code in module/basic.py

def __init__(self, task: str, should_evaluate: bool, id: int) -> None:
    """
    Initialize a session.
    :param task: The name of current task.
    :param should_evaluate: Whether to evaluate the session.
    :param id: The id of the session.
    """

    self._should_evaluate = should_evaluate
    self._id = id

    # Logging-related properties
    self.log_path = f"logs/{task}/"
    utils.create_folder(self.log_path)

    self._rounds: Dict[int, BaseRound] = {}

    self._context = Context()
    self._init_context()
    self._finish = False

    self._host_agent: HostAgent = AgentFactory.create_agent(
        "host",
        "HostAgent",
        configs["HOST_AGENT"]["VISUAL_MODE"],
        configs["HOSTAGENT_PROMPT"],
        configs["HOSTAGENT_EXAMPLE_PROMPT"],
        configs["API_PROMPT"],
        configs["ALLOW_OPENAPP"],
    )

`application_window: UIAWrapper` `property` `writable`

Get the application of the session. return: The application of the session.

`context: Context` `property`

Get the context of the session. return: The context of the session.

`cost: float` `property` `writable`

Get the cost of the session. return: The cost of the session.

`current_round: BaseRound` `property`

Get the current round of the session. return: The current round of the session.

`evaluation_logger: logging.Logger` `property`

Get the logger for evaluation. return: The logger for evaluation.

`id: int` `property`

Get the id of the session. return: The id of the session.

`rounds: Dict[int, BaseRound]` `property`

Get the rounds of the session. return: The rounds of the session.

`session_type: str` `property`

Get the class name of the session. return: The class name of the session.

`step: int` `property`

Get the step of the session. return: The step of the session.

`total_rounds: int` `property`

Get the total number of rounds in the session. return: The total number of rounds in the session.

`add_round(id, round)`

Add a round to the session.

Parameters:	`id` (`int`) – The id of the round. `round` (`BaseRound`) – The round to be added.

Source code in module/basic.py

def add_round(self, id: int, round: BaseRound) -> None:
    """
    Add a round to the session.
    :param id: The id of the round.
    :param round: The round to be added.
    """
    self._rounds[id] = round

`capture_last_snapshot()`

Capture the last snapshot of the application, including the screenshot and the XML file if configured.

Source code in module/basic.py

def capture_last_snapshot(self) -> None:
    """
    Capture the last snapshot of the application, including the screenshot and the XML file if configured.
    """

    # Capture the final screenshot
    screenshot_save_path = self.log_path + f"action_step_final.png"

    if self.application_window is not None:

        try:
            PhotographerFacade().capture_app_window_screenshot(
                self.application_window, save_path=screenshot_save_path
            )

        except Exception as e:
            utils.print_with_color(
                f"Warning: The last snapshot capture failed, due to the error: {e}",
                "yellow",
            )

        # Save the final XML file
        if configs["LOG_XML"]:
            log_abs_path = os.path.abspath(self.log_path)
            xml_save_path = os.path.join(log_abs_path, f"xml/action_step_final.xml")

            app_agent = self._host_agent.get_active_appagent()
            if app_agent is not None:
                app_agent.Puppeteer.save_to_xml(xml_save_path)

`create_following_round()`

Create a following round. return: The following round.

Source code in module/basic.py

def create_following_round(self) -> BaseRound:
    """
    Create a following round.
    return: The following round.
    """
    pass

`create_new_round()` `abstractmethod`

Create a new round.

Source code in module/basic.py

@abstractmethod
def create_new_round(self) -> Optional[BaseRound]:
    """
    Create a new round.
    """
    pass

`evaluation()`

Evaluate the session.

Source code in module/basic.py

def evaluation(self) -> None:
    """
    Evaluate the session.
    """
    utils.print_with_color("Evaluating the session...", "yellow")
    evaluator = EvaluationAgent(
        name="eva_agent",
        app_root_name=self.context.get(ContextNames.APPLICATION_ROOT_NAME),
        is_visual=configs["APP_AGENT"]["VISUAL_MODE"],
        main_prompt=configs["EVALUATION_PROMPT"],
        example_prompt="",
        api_prompt=configs["API_PROMPT"],
    )

    requests = self.request_to_evaluate()
    result, cost = evaluator.evaluate(request=requests, log_path=self.log_path)

    # Add additional information to the evaluation result.
    additional_info = {"level": "session", "request": requests, "id": 0}
    result.update(additional_info)

    self.cost += cost

    evaluator.print_response(result)

    self.evaluation_logger.info(json.dumps(result))

`experience_saver()`

Save the current trajectory as agent experience.

Source code in module/basic.py

def experience_saver(self) -> None:
    """
    Save the current trajectory as agent experience.
    """
    utils.print_with_color(
        "Summarizing and saving the execution flow as experience...", "yellow"
    )

    summarizer = ExperienceSummarizer(
        configs["APP_AGENT"]["VISUAL_MODE"],
        configs["EXPERIENCE_PROMPT"],
        configs["APPAGENT_EXAMPLE_PROMPT"],
        configs["API_PROMPT"],
    )
    experience = summarizer.read_logs(self.log_path)
    summaries, cost = summarizer.get_summary_list(experience)

    experience_path = configs["EXPERIENCE_SAVED_PATH"]
    utils.create_folder(experience_path)
    summarizer.create_or_update_yaml(
        summaries, os.path.join(experience_path, "experience.yaml")
    )
    summarizer.create_or_update_vector_db(
        summaries, os.path.join(experience_path, "experience_db")
    )

    self.cost += cost
    utils.print_with_color("The experience has been saved.", "magenta")

`initialize_logger(log_path, log_filename)` `staticmethod`

Initialize logging. log_path: The path of the log file. log_filename: The name of the log file. return: The logger.

Source code in module/basic.py

@staticmethod
def initialize_logger(log_path: str, log_filename: str) -> logging.Logger:
    """
    Initialize logging.
    log_path: The path of the log file.
    log_filename: The name of the log file.
    return: The logger.
    """
    # Code for initializing logging
    logger = logging.Logger(log_filename)

    if not configs["PRINT_LOG"]:
        # Remove existing handlers if PRINT_LOG is False
        logger.handlers = []

    log_file_path = os.path.join(log_path, log_filename)
    file_handler = logging.FileHandler(log_file_path, encoding="utf-8")
    formatter = logging.Formatter("%(message)s")
    file_handler.setFormatter(formatter)
    logger.addHandler(file_handler)
    logger.setLevel(configs["LOG_LEVEL"])

    return logger

`is_error()`

Check if the session is in error state. return: True if the session is in error state, otherwise False.

Source code in module/basic.py

def is_error(self):
    """
    Check if the session is in error state.
    return: True if the session is in error state, otherwise False.
    """
    if self.current_round is not None:
        return self.current_round.state.name() == AgentStatus.ERROR.value
    return False

`is_finished()`

Check if the session is ended. return: True if the session is ended, otherwise False.

Source code in module/basic.py

def is_finished(self) -> bool:
    """
    Check if the session is ended.
    return: True if the session is ended, otherwise False.
    """
    if self._finish or self.step >= configs["MAX_STEP"]:
        return True

    if self.is_error():
        return True

    return False

`next_request()` `abstractmethod`

Get the next request of the session. return: The request of the session.

Source code in module/basic.py

@abstractmethod
def next_request(self) -> str:
    """
    Get the next request of the session.
    return: The request of the session.
    """
    pass

`print_cost()`

Print the total cost of the session.

Source code in module/basic.py

def print_cost(self) -> None:
    """
    Print the total cost of the session.
    """

    if isinstance(self.cost, float) and self.cost > 0:
        formatted_cost = "${:.2f}".format(self.cost)
        utils.print_with_color(
            f"Total request cost of the session: {formatted_cost}$", "yellow"
        )
    else:
        utils.print_with_color(
            "Cost is not available for the model {host_model} or {app_model}.".format(
                host_model=configs["HOST_AGENT"]["API_MODEL"],
                app_model=configs["APP_AGENT"]["API_MODEL"],
            ),
            "yellow",
        )

`request_to_evaluate()` `abstractmethod`

Get the request to evaluate. return: The request(s) to evaluate.

Source code in module/basic.py

@abstractmethod
def request_to_evaluate(self) -> str:
    """
    Get the request to evaluate.
    return: The request(s) to evaluate.
    """
    pass

`run()`

Run the session.

Source code in module/basic.py

def run(self) -> None:
    """
    Run the session.
    """

    while not self.is_finished():

        round = self.create_new_round()
        if round is None:
            break
        round.run()

    if self.application_window is not None:
        self.capture_last_snapshot()

    if self._should_evaluate and not self.is_error():
        self.evaluation()

    self.print_cost()

Session

Session Lifecycle

1. Session Initialization

2. Session Processing

3. Next Round

4. Session Termination

Reference

application_window: UIAWrapper property writable

context: Context property

cost: float property writable

current_round: BaseRound property

evaluation_logger: logging.Logger property

id: int property

rounds: Dict[int, BaseRound] property

session_type: str property

step: int property

total_rounds: int property

add_round(id, round)

capture_last_snapshot()

create_following_round()

create_new_round() abstractmethod

evaluation()

experience_saver()

initialize_logger(log_path, log_filename) staticmethod

is_error()

is_finished()

next_request() abstractmethod

print_cost()

request_to_evaluate() abstractmethod

run()

`application_window: UIAWrapper` `property` `writable`

`context: Context` `property`

`cost: float` `property` `writable`

`current_round: BaseRound` `property`

`evaluation_logger: logging.Logger` `property`

`id: int` `property`

`rounds: Dict[int, BaseRound]` `property`

`session_type: str` `property`

`step: int` `property`

`total_rounds: int` `property`

`add_round(id, round)`

`capture_last_snapshot()`

`create_following_round()`

`create_new_round()` `abstractmethod`

`evaluation()`

`experience_saver()`

`initialize_logger(log_path, log_filename)` `staticmethod`

`is_error()`

`is_finished()`

`next_request()` `abstractmethod`

`print_cost()`

`request_to_evaluate()` `abstractmethod`

`run()`