Model returns mutiple tools to call even when `parallel_tool_calls` is set to `False`

Ask Question

Asked 3 months ago

Modified 3 months ago

Viewed 195 times

Part of OpenAI Collective

I’m experimenting with the OpenAI Agents SDK (Python) and ran into unexpected behavior with parallel_tool_calls.

According to the docs, setting parallel_tool_calls=False should ensure that the model makes at most one tool call per turn. However, I’m still seeing multiple tool calls returned in the assistant response.

My Code:

client = AsyncOpenAI(
    api_key=GEMINI_API_KEY,
    base_url=BASE_URL
)
model = OpenAIChatCompletionsModel(
    openai_client=client,
    model="gemini-2.0-flash" # using openai sdk with gemini model
)


# Emited some code for brevity

async def main():
    agent = Agent[UserInfo](
        name="Assistant",
        model=model,
        tools=[fetch_user_uid, fetch_weather], # Provided two tools
        model_settings=ModelSettings(parallel_tool_calls=False) # Setting `parallel_tool_calls` to false.
    )

    result = await Runner.run(
        starting_agent=agent,
        input="what's user uid and what's the weather in karachi?", # asking question to check if the model is calling both tools
        context=user_info,
    )

if __name__ == "__main__":
    asyncio.run(main())

Logs:

...
LLM resp:
{
  "content": null,
  "refusal": null,
  "role": "assistant",
  "annotations": null,
  "audio": null,
  "function_call": null,
  "tool_calls": [ # Model is asking to call both tools even when `parallel_tool_calls` is set to `False`
    {
      "id": "",
      "function": {
        "arguments": "{}",
        "name": "fetch_user_uid"
      },
      "type": "function"
    },
    {
      "id": "",
      "function": {
        "arguments": "{\"city\":\"karachi\"}",
        "name": "fetch_weather"
      },
      "type": "function"
    }
  ]
}

...

Expected results:

LLM calls at most one tool call per turn.

Actual results:

LLM is calling both tools even when parallel_tool_calls is set to False

asked Aug 17 at 18:48

Abishai Kashif

11 bronze badge

I think it may mean something different. With parallel it may execute all tools at the same time - to make it faster. Without parallel it may execute next tool after getting result from previous tool - it allows to use result from one tool as information for next tool.

furas
– furas

2025-08-17 21:03:23 +00:00
Commented Aug 17 at 21:03
@furas, No! there no logic in openai agents sdk repo regarding that, instead parallel_tool_calls parameter directly goes to the chatcompletions.create() request, and their docs states that model will not return multiple tools to call.

Abishai Kashif
– Abishai Kashif

2025-08-18 10:59:24 +00:00
Commented Aug 18 at 10:59
by the way: there are similar sites for ML, AI: DataScience, CrossValidated

furas
– furas

2025-08-18 18:12:03 +00:00
Commented Aug 18 at 18:12

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Model returns mutiple tools to call even when `parallel_tool_calls` is set to `False`

Expected results:

Actual results:

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

Expected results:

Actual results:

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest