Round

A Round is a single interaction between the user and UFO that processes a single user request. A Round is responsible for orchestrating the HostAgent and AppAgent to fulfill the user's request.

Round Lifecycle

In a Round, the following steps are executed:

1. Round Initialization

At the beginning of a Round, the Round object is created, and the user's request is processed by the HostAgent to determine the appropriate application to fulfill the request.

2. Action Execution

Once created, the Round orchestrates the HostAgent and AppAgent to execute the necessary actions to fulfill the user's request. The core logic of a Round is shown below:

def run(self) -> None:
    """
    Run the round.
    """

    while not self.is_finished():

        self.agent.handle(self.context)

        self.state = self.agent.state.next_state(self.agent)
        self.agent = self.agent.state.next_agent(self.agent)
        self.agent.set_state(self.state)

        # If the subtask ends, capture the last snapshot of the application.
        if self.state.is_subtask_end():
            time.sleep(configs["SLEEP_TIME"])
            self.capture_last_snapshot(sub_round_id=self.subtask_amount)
            self.subtask_amount += 1

    self.agent.blackboard.add_requests(
        {"request_{i}".format(i=self.id), self.request}
    )

    if self.application_window is not None:
        self.capture_last_snapshot()

    if self._should_evaluate:
        self.evaluation()

At each step, the Round processes the user's request by invoking the handle method of the AppAgent or HostAgent based on the current state. The state determines the next agent to handle the request and the next state to transition to.

3. Request Completion

The AppAgent completes the actions within the application. If the request spans multiple applications, the HostAgent may switch to a different application to continue the task.

4. Round Termination

Once the user's request is fulfilled, the Round is terminated, and the results are returned to the user. If configured, the EvaluationAgent evaluates the completeness of the Round.

Reference

Bases: ABC

A round of a session in UFO. A round manages a single user request and consists of multiple steps. A session may consists of multiple rounds of interactions.

Initialize a round.

Parameters:	`request` (`str`) – The request of the round. `agent` (`BasicAgent`) – The initial agent of the round. `context` (`Context`) – The shared context of the round. `should_evaluate` (`bool`) – Whether to evaluate the round. `id` (`int`) – The id of the round.

Source code in module/basic.py

def __init__(
    self,
    request: str,
    agent: BasicAgent,
    context: Context,
    should_evaluate: bool,
    id: int,
) -> None:
    """
    Initialize a round.
    :param request: The request of the round.
    :param agent: The initial agent of the round.
    :param context: The shared context of the round.
    :param should_evaluate: Whether to evaluate the round.
    :param id: The id of the round.
    """

    self._request = request
    self._context = context
    self._agent = agent
    self._state = agent.state
    self._id = id
    self._should_evaluate = should_evaluate

    self._init_context()

`agent` `property` `writable`

Get the agent of the round. return: The agent of the round.

`application_window` `property` `writable`

Get the application of the session. return: The application of the session.

`context` `property`

Get the context of the round. return: The context of the round.

`cost` `property`

Get the cost of the round. return: The cost of the round.

`id` `property`

Get the id of the round. return: The id of the round.

`log_path` `property`

Get the log path of the round.

return: The log path of the round.

`request` `property`

Get the request of the round. return: The request of the round.

`state` `property` `writable`

Get the status of the round. return: The status of the round.

`step` `property`

Get the local step of the round. return: The step of the round.

`subtask_amount` `property` `writable`

Get the subtask amount of the round. return: The subtask amount of the round.

`_init_context()`

Update the context of the round.

Source code in module/basic.py

def _init_context(self) -> None:
    """
    Update the context of the round.
    """

    # Initialize the round step
    round_step = {self.id: 0}
    self.context.update_dict(ContextNames.ROUND_STEP, round_step)

    # Initialize the round cost
    round_cost = {self.id: 0}
    self.context.update_dict(ContextNames.ROUND_COST, round_cost)

    # Initialize the round subtask amount
    round_subtask_amount = {self.id: 0}
    self.context.update_dict(
        ContextNames.ROUND_SUBTASK_AMOUNT, round_subtask_amount
    )

    # Initialize the round request and the current round id
    self.context.set(ContextNames.REQUEST, self.request)

    self.context.set(ContextNames.CURRENT_ROUND_ID, self.id)

`capture_last_snapshot(sub_round_id=None)`

Capture the last snapshot of the application, including the screenshot and the XML file if configured.

Parameters:	`sub_round_id` (`Optional[int]`, default: `None` ) – The id of the sub-round, default is None.

Source code in module/basic.py

def capture_last_snapshot(self, sub_round_id: Optional[int] = None) -> None:
    """
    Capture the last snapshot of the application, including the screenshot and the XML file if configured.
    :param sub_round_id: The id of the sub-round, default is None.
    """

    # Capture the final screenshot
    if sub_round_id is None:
        screenshot_save_path = self.log_path + f"action_round_{self.id}_final.png"
    else:
        screenshot_save_path = (
            self.log_path
            + f"action_round_{self.id}_sub_round_{sub_round_id}_final.png"
        )

    if self.application_window is not None:

        try:
            PhotographerFacade().capture_app_window_screenshot(
                self.application_window, save_path=screenshot_save_path
            )

        except Exception as e:
            utils.print_with_color(
                f"Warning: The last snapshot capture failed, due to the error: {e}",
                "yellow",
            )

        if configs.get("SAVE_UI_TREE", False):
            step_ui_tree = ui_tree.UITree(self.application_window)

            ui_tree_path = os.path.join(self.log_path, "ui_trees")

            ui_tree_file_name = (
                f"ui_tree_round_{self.id}_final.json"
                if sub_round_id is None
                else f"ui_tree_round_{self.id}_sub_round_{sub_round_id}_final.json"
            )

            step_ui_tree.save_ui_tree_to_json(
                os.path.join(
                    ui_tree_path,
                    ui_tree_file_name,
                )
            )

        if configs.get("SAVE_FULL_SCREEN", False):

            desktop_save_path = (
                self.log_path
                + f"desktop_round_{self.id}_sub_round_{sub_round_id}_final.png"
            )

            # Capture the desktop screenshot for all screens.
            PhotographerFacade().capture_desktop_screen_screenshot(
                all_screens=True, save_path=desktop_save_path
            )

        # Save the final XML file
        if configs["LOG_XML"]:
            log_abs_path = os.path.abspath(self.log_path)
            xml_save_path = os.path.join(
                log_abs_path,
                (
                    f"xml/action_round_{self.id}_final.xml"
                    if sub_round_id is None
                    else f"xml/action_round_{self.id}_sub_round_{sub_round_id}_final.xml"
                ),
            )

            if issubclass(type(self.agent), HostAgent):

                app_agent: AppAgent = self.agent.get_active_appagent()
                app_agent.Puppeteer.save_to_xml(xml_save_path)
            elif issubclass(type(self.agent), AppAgent):
                app_agent: AppAgent = self.agent
                app_agent.Puppeteer.save_to_xml(xml_save_path)

`evaluation()`

TODO: Evaluate the round.

Source code in module/basic.py

def evaluation(self) -> None:
    """
    TODO: Evaluate the round.
    """
    pass

`is_finished()`

Check if the round is finished. return: True if the round is finished, otherwise False.

Source code in module/basic.py

def is_finished(self) -> bool:
    """
    Check if the round is finished.
    return: True if the round is finished, otherwise False.
    """
    return (
        self.state.is_round_end()
        or self.context.get(ContextNames.SESSION_STEP) >= configs["MAX_STEP"]
    )

`print_cost()`

Print the total cost of the round.

Source code in module/basic.py

def print_cost(self) -> None:
    """
    Print the total cost of the round.
    """

    total_cost = self.cost
    if isinstance(total_cost, float):
        formatted_cost = "${:.2f}".format(total_cost)
        utils.print_with_color(
            f"Request total cost for current round is {formatted_cost}", "yellow"
        )

`run()`

Run the round.

Source code in module/basic.py

def run(self) -> None:
    """
    Run the round.
    """

    while not self.is_finished():

        self.agent.handle(self.context)

        self.state = self.agent.state.next_state(self.agent)
        self.agent = self.agent.state.next_agent(self.agent)
        self.agent.set_state(self.state)

        # If the subtask ends, capture the last snapshot of the application.
        if self.state.is_subtask_end():
            time.sleep(configs["SLEEP_TIME"])
            self.capture_last_snapshot(sub_round_id=self.subtask_amount)
            self.subtask_amount += 1

    self.agent.blackboard.add_requests(
        {"request_{i}".format(i=self.id): self.request}
    )

    if self.application_window is not None:
        self.capture_last_snapshot()

    if self._should_evaluate:
        self.evaluation()

Round

Round Lifecycle

1. Round Initialization

2. Action Execution

3. Request Completion

4. Round Termination

Reference

agent property writable

application_window property writable

context property

cost property

id property

log_path property

request property

state property writable

step property

subtask_amount property writable

_init_context()

capture_last_snapshot(sub_round_id=None)

evaluation()

is_finished()

print_cost()

run()

`agent` `property` `writable`

`application_window` `property` `writable`

`context` `property`

`cost` `property`

`id` `property`

`log_path` `property`

`request` `property`

`state` `property` `writable`

`step` `property`

`subtask_amount` `property` `writable`

`_init_context()`

`capture_last_snapshot(sub_round_id=None)`

`evaluation()`

`is_finished()`

`print_cost()`

`run()`