Funcpod
| Tenant | Namespace | Podname | Model |
|---|---|---|---|
| public | Trial | public/Trial/L3.3-70B-Loki-V2.0/259/274 | L3.3-70B-Loki-V2.0 |
State
| State | Time |
|---|---|
| Init | 2026-03-01 17:18:43 |
| PullingImage | 2026-03-01 17:18:43 |
| Creating | 2026-03-01 17:18:44 |
| Restoring | 2026-03-01 17:18:47 |
| Standby | 2026-03-01 17:18:47 |
| Resuming | 2026-03-01 18:31:36 |
| Ready | 2026-03-01 18:31:37 |
Log
| INFO 03-01 18:31:41 [loggers.py:116] Engine 000: Avg prompt throughput: 0.0 tokens/s, Avg generation throughput: 0.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 0.0% INFO 03-01 18:31:41 [logger.py:42] Received request cmpl-e5f28d0d250842fabdbe81f71c9f792f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=800, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO 03-01 18:31:41 [logger.py:42] Received request cmpl-3d62a2753c9d4eb583a521204dbbaa1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.4:123 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:42 [async_llm.py:261] Added request cmpl-e5f28d0d250842fabdbe81f71c9f792f-0. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:42 [async_llm.py:261] Added request cmpl-3d62a2753c9d4eb583a521204dbbaa1d-0. INFO 03-01 18:31:44 [logger.py:42] Received request cmpl-b671876b06ab4c9d93cd83208cbd3988-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:44 [async_llm.py:261] Added request cmpl-b671876b06ab4c9d93cd83208cbd3988-0. INFO 03-01 18:31:45 [logger.py:42] Received request cmpl-f6ca3a3b2dbc4a679e7ba1b3cc62402f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:45 [async_llm.py:261] Added request cmpl-f6ca3a3b2dbc4a679e7ba1b3cc62402f-0. INFO 03-01 18:31:46 [logger.py:42] Received request cmpl-b831b651af73484ca51044d3133b019a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:47 [async_llm.py:261] Added request cmpl-b831b651af73484ca51044d3133b019a-0. INFO 03-01 18:31:48 [logger.py:42] Received request cmpl-24876c5cf0594359bc54ca15279c3562-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:48 [async_llm.py:261] Added request cmpl-24876c5cf0594359bc54ca15279c3562-0. INFO 03-01 18:31:49 [logger.py:42] Received request cmpl-ac3b165dc2b741f4b0927367192adae1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:49 [async_llm.py:261] Added request cmpl-ac3b165dc2b741f4b0927367192adae1-0. INFO 03-01 18:31:50 [logger.py:42] Received request cmpl-6510ea3c49784e4d90f5450871431968-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:50 [async_llm.py:261] Added request cmpl-6510ea3c49784e4d90f5450871431968-0. INFO 03-01 18:31:51 [logger.py:42] Received request cmpl-2fafc52946324380afdab3ef3bb0ffe7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:51 [async_llm.py:261] Added request cmpl-2fafc52946324380afdab3ef3bb0ffe7-0. INFO 03-01 18:31:51 [loggers.py:116] Engine 000: Avg prompt throughput: 24.6 tokens/s, Avg generation throughput: 30.5 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 1.9%, Prefix cache hit rate: 45.9% INFO 03-01 18:31:53 [logger.py:42] Received request cmpl-b925e41694bb41cab19653c1777dfc51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:53 [async_llm.py:261] Added request cmpl-b925e41694bb41cab19653c1777dfc51-0. INFO 03-01 18:31:54 [logger.py:42] Received request cmpl-d1fbbc2b778b468582a15a49373e154f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:54 [async_llm.py:261] Added request cmpl-d1fbbc2b778b468582a15a49373e154f-0. INFO 03-01 18:31:55 [logger.py:42] Received request cmpl-677cb4d7114c400894f98d4fa57c6cb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:55 [async_llm.py:261] Added request cmpl-677cb4d7114c400894f98d4fa57c6cb7-0. INFO 03-01 18:31:56 [logger.py:42] Received request cmpl-9dfcd8632c084eacbc5316e67cb0c463-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:56 [async_llm.py:261] Added request cmpl-9dfcd8632c084eacbc5316e67cb0c463-0. INFO 03-01 18:31:57 [logger.py:42] Received request cmpl-1a3b9612a8d0478192864d99b6bbc328-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:57 [async_llm.py:261] Added request cmpl-1a3b9612a8d0478192864d99b6bbc328-0. INFO 03-01 18:31:58 [logger.py:42] Received request cmpl-ab5f62511ac3483da02efc2683551ba0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:31:58 [async_llm.py:261] Added request cmpl-ab5f62511ac3483da02efc2683551ba0-0. INFO 03-01 18:32:00 [logger.py:42] Received request cmpl-132c2cf0e20e41249f0d6a33a4d8f81d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:00 [async_llm.py:261] Added request cmpl-132c2cf0e20e41249f0d6a33a4d8f81d-0. INFO 03-01 18:32:01 [logger.py:42] Received request cmpl-0c175bdd5f0f40aa8412305c2d243239-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:01 [async_llm.py:261] Added request cmpl-0c175bdd5f0f40aa8412305c2d243239-0. INFO 03-01 18:32:01 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 11.8 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 48.7% INFO 03-01 18:32:02 [logger.py:42] Received request cmpl-ccf4ec23718247c480898fc11632cdbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:02 [async_llm.py:261] Added request cmpl-ccf4ec23718247c480898fc11632cdbf-0. INFO 03-01 18:32:03 [logger.py:42] Received request cmpl-19b8a4c92eef40aeaebff985a21a95d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:03 [async_llm.py:261] Added request cmpl-19b8a4c92eef40aeaebff985a21a95d6-0. INFO 03-01 18:32:04 [logger.py:42] Received request cmpl-0b5c49905d564b4184f1a22c9530792b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=800, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.4:123 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:04 [async_llm.py:261] Added request cmpl-0b5c49905d564b4184f1a22c9530792b-0. INFO 03-01 18:32:04 [logger.py:42] Received request cmpl-ab0c45cad77a415e8693557e4a4587da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:04 [async_llm.py:261] Added request cmpl-ab0c45cad77a415e8693557e4a4587da-0. INFO 03-01 18:32:06 [logger.py:42] Received request cmpl-b180364c70a24a65a10ca27ce19c7940-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:06 [async_llm.py:261] Added request cmpl-b180364c70a24a65a10ca27ce19c7940-0. INFO 03-01 18:32:07 [logger.py:42] Received request cmpl-5156b1bb6b294b02a88bdf1bb19d66b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:07 [async_llm.py:261] Added request cmpl-5156b1bb6b294b02a88bdf1bb19d66b0-0. INFO 03-01 18:32:08 [logger.py:42] Received request cmpl-05cd1dfe7ba94d3194d14aacf665a926-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:08 [async_llm.py:261] Added request cmpl-05cd1dfe7ba94d3194d14aacf665a926-0. INFO 03-01 18:32:09 [logger.py:42] Received request cmpl-b57f8fbf34754ce8a46aed7f17bcda60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:09 [async_llm.py:261] Added request cmpl-b57f8fbf34754ce8a46aed7f17bcda60-0. INFO 03-01 18:32:10 [logger.py:42] Received request cmpl-c8e79dd09f594e79ae7fe333b86e7120-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:10 [async_llm.py:261] Added request cmpl-c8e79dd09f594e79ae7fe333b86e7120-0. INFO 03-01 18:32:11 [loggers.py:116] Engine 000: Avg prompt throughput: 27.8 tokens/s, Avg generation throughput: 29.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 1.8%, Prefix cache hit rate: 49.7% INFO 03-01 18:32:11 [logger.py:42] Received request cmpl-c5fb814b3ef4483aad02e911086c0d32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:11 [async_llm.py:261] Added request cmpl-c5fb814b3ef4483aad02e911086c0d32-0. INFO 03-01 18:32:13 [logger.py:42] Received request cmpl-69c362fa2d6b4d68b4044582ca137b96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:13 [async_llm.py:261] Added request cmpl-69c362fa2d6b4d68b4044582ca137b96-0. INFO 03-01 18:32:14 [logger.py:42] Received request cmpl-b525a35385c94cc58ac660acc3ab6359-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:14 [async_llm.py:261] Added request cmpl-b525a35385c94cc58ac660acc3ab6359-0. INFO 03-01 18:32:15 [logger.py:42] Received request cmpl-207ebb24843b40e397658df51f767e36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:15 [async_llm.py:261] Added request cmpl-207ebb24843b40e397658df51f767e36-0. INFO 03-01 18:32:16 [logger.py:42] Received request cmpl-bfacf66327d344c6912733aa48035a3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:16 [async_llm.py:261] Added request cmpl-bfacf66327d344c6912733aa48035a3f-0. INFO 03-01 18:32:17 [logger.py:42] Received request cmpl-7025522022d5437da5219ffa750f690b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:17 [async_llm.py:261] Added request cmpl-7025522022d5437da5219ffa750f690b-0. INFO 03-01 18:32:19 [logger.py:42] Received request cmpl-d725ce9bbf4c420c864ff16365865d54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:19 [async_llm.py:261] Added request cmpl-d725ce9bbf4c420c864ff16365865d54-0. INFO 03-01 18:32:20 [logger.py:42] Received request cmpl-15cd044741d94fb89470ee5ace6d23f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:20 [async_llm.py:261] Added request cmpl-15cd044741d94fb89470ee5ace6d23f4-0. INFO 03-01 18:32:21 [logger.py:42] Received request cmpl-08d9962fa5664f49a1c3da843d28501c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:21 [async_llm.py:261] Added request cmpl-08d9962fa5664f49a1c3da843d28501c-0. INFO 03-01 18:32:21 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 13.6 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.2% INFO 03-01 18:32:22 [logger.py:42] Received request cmpl-b59691aaaeb345d081994bf94311649a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:22 [async_llm.py:261] Added request cmpl-b59691aaaeb345d081994bf94311649a-0. INFO 03-01 18:32:23 [logger.py:42] Received request cmpl-4d6b10b6ef7240ad89f2551c08fa650d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:23 [async_llm.py:261] Added request cmpl-4d6b10b6ef7240ad89f2551c08fa650d-0. INFO 03-01 18:32:24 [logger.py:42] Received request cmpl-44ed9c8e504a42c8a1df9ece482c6139-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:24 [async_llm.py:261] Added request cmpl-44ed9c8e504a42c8a1df9ece482c6139-0. INFO 03-01 18:32:25 [logger.py:42] Received request cmpl-618f2e37b74d40fe9b48ba5f6f9b685b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:26 [async_llm.py:261] Added request cmpl-618f2e37b74d40fe9b48ba5f6f9b685b-0. INFO 03-01 18:32:27 [logger.py:42] Received request cmpl-4e20b7f4abd74d4abd77cffda6bf9f4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:27 [async_llm.py:261] Added request cmpl-4e20b7f4abd74d4abd77cffda6bf9f4f-0. INFO 03-01 18:32:28 [logger.py:42] Received request cmpl-597b2cb241124a3aafff645a92d79156-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:28 [async_llm.py:261] Added request cmpl-597b2cb241124a3aafff645a92d79156-0. INFO 03-01 18:32:29 [logger.py:42] Received request cmpl-284a1eeecbf64940b528fac29183c8e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:29 [async_llm.py:261] Added request cmpl-284a1eeecbf64940b528fac29183c8e4-0. INFO 03-01 18:32:30 [logger.py:42] Received request cmpl-54ad042aea6542e1bd254c829e5c957e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:30 [async_llm.py:261] Added request cmpl-54ad042aea6542e1bd254c829e5c957e-0. INFO 03-01 18:32:31 [logger.py:42] Received request cmpl-a642986c7464475db298a78cbd3b098a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:31 [async_llm.py:261] Added request cmpl-a642986c7464475db298a78cbd3b098a-0. INFO 03-01 18:32:31 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 50.5% INFO 03-01 18:32:32 [logger.py:42] Received request cmpl-c88d35d8f666409cbd34381003adde30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:32 [async_llm.py:261] Added request cmpl-c88d35d8f666409cbd34381003adde30-0. INFO 03-01 18:32:34 [logger.py:42] Received request cmpl-dd3d2c3ce33841a788d12d5d52dfc4b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:34 [async_llm.py:261] Added request cmpl-dd3d2c3ce33841a788d12d5d52dfc4b7-0. INFO 03-01 18:32:35 [logger.py:42] Received request cmpl-3545bff137dd4cb0a9f5782fb2b02d08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:35 [async_llm.py:261] Added request cmpl-3545bff137dd4cb0a9f5782fb2b02d08-0. INFO 03-01 18:32:36 [logger.py:42] Received request cmpl-dfca8767f57245db9281fd5f87bec2b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:36 [async_llm.py:261] Added request cmpl-dfca8767f57245db9281fd5f87bec2b8-0. INFO 03-01 18:32:37 [logger.py:42] Received request cmpl-f918c6eaf1254c2397c47849a7cec601-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:37 [async_llm.py:261] Added request cmpl-f918c6eaf1254c2397c47849a7cec601-0. INFO 03-01 18:32:38 [logger.py:42] Received request cmpl-3672416ff9254cf49f89f8ea8c518ef0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:38 [async_llm.py:261] Added request cmpl-3672416ff9254cf49f89f8ea8c518ef0-0. INFO 03-01 18:32:39 [logger.py:42] Received request cmpl-b75b735da8a04c36968cd1abe0729609-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:39 [async_llm.py:261] Added request cmpl-b75b735da8a04c36968cd1abe0729609-0. INFO 03-01 18:32:41 [logger.py:42] Received request cmpl-0e36e3a9775f46d1a64ff492b18344ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:41 [async_llm.py:261] Added request cmpl-0e36e3a9775f46d1a64ff492b18344ce-0. INFO 03-01 18:32:41 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.6% INFO 03-01 18:32:42 [logger.py:42] Received request cmpl-b76c0bbab9ce4475ad63149a37689d2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:42 [async_llm.py:261] Added request cmpl-b76c0bbab9ce4475ad63149a37689d2e-0. INFO 03-01 18:32:43 [logger.py:42] Received request cmpl-c6e36647036945cc855c6ff14feaaec3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:43 [async_llm.py:261] Added request cmpl-c6e36647036945cc855c6ff14feaaec3-0. INFO 03-01 18:32:44 [logger.py:42] Received request cmpl-09920df2d85a4e5d8b483e9b3c40b508-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:44 [async_llm.py:261] Added request cmpl-09920df2d85a4e5d8b483e9b3c40b508-0. INFO 03-01 18:32:45 [logger.py:42] Received request cmpl-2c00add53784465d9902dd53755f7d7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:45 [async_llm.py:261] Added request cmpl-2c00add53784465d9902dd53755f7d7c-0. INFO 03-01 18:32:46 [logger.py:42] Received request cmpl-3760e461940c4dd7a36f43348a33d4f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:46 [async_llm.py:261] Added request cmpl-3760e461940c4dd7a36f43348a33d4f2-0. INFO 03-01 18:32:48 [logger.py:42] Received request cmpl-19ef2ae0d4d24e27a864ce940ddb7a31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:48 [async_llm.py:261] Added request cmpl-19ef2ae0d4d24e27a864ce940ddb7a31-0. INFO 03-01 18:32:49 [logger.py:42] Received request cmpl-b01c14dbe23d457dbb2dc9d9500b6711-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:49 [async_llm.py:261] Added request cmpl-b01c14dbe23d457dbb2dc9d9500b6711-0. INFO 03-01 18:32:50 [logger.py:42] Received request cmpl-3f0c3e8b338f4815a1930b23492c2e40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:50 [async_llm.py:261] Added request cmpl-3f0c3e8b338f4815a1930b23492c2e40-0. INFO 03-01 18:32:51 [logger.py:42] Received request cmpl-62aed95caa71484aa8508ba2679b54b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:51 [async_llm.py:261] Added request cmpl-62aed95caa71484aa8508ba2679b54b0-0. INFO 03-01 18:32:51 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.8% INFO 03-01 18:32:52 [logger.py:42] Received request cmpl-6c4bd9cf4dd24c73b282a8b30e083191-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:52 [async_llm.py:261] Added request cmpl-6c4bd9cf4dd24c73b282a8b30e083191-0. INFO 03-01 18:32:53 [logger.py:42] Received request cmpl-4f5682ab757d4c3fa7c06625de48cf28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:53 [async_llm.py:261] Added request cmpl-4f5682ab757d4c3fa7c06625de48cf28-0. INFO 03-01 18:32:54 [logger.py:42] Received request cmpl-edebcd60b29e4213b5028f0bbbc022bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:54 [async_llm.py:261] Added request cmpl-edebcd60b29e4213b5028f0bbbc022bb-0. INFO 03-01 18:32:56 [logger.py:42] Received request cmpl-d13bb04e797244d1807bf841907add89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:56 [async_llm.py:261] Added request cmpl-d13bb04e797244d1807bf841907add89-0. INFO 03-01 18:32:57 [logger.py:42] Received request cmpl-80a624ad84464bc6adac7662c10cf6b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:57 [async_llm.py:261] Added request cmpl-80a624ad84464bc6adac7662c10cf6b7-0. INFO 03-01 18:32:58 [logger.py:42] Received request cmpl-3d4bf2533ab34e919cb7c3fe3bb6710d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:58 [async_llm.py:261] Added request cmpl-3d4bf2533ab34e919cb7c3fe3bb6710d-0. INFO 03-01 18:32:59 [logger.py:42] Received request cmpl-46135a04056145d6b68c892e81a29085-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:32:59 [async_llm.py:261] Added request cmpl-46135a04056145d6b68c892e81a29085-0. INFO 03-01 18:33:00 [logger.py:42] Received request cmpl-8ab67acc99b94372a855c1355ca8999e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:00 [async_llm.py:261] Added request cmpl-8ab67acc99b94372a855c1355ca8999e-0. INFO 03-01 18:33:01 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 50.9% INFO 03-01 18:33:01 [logger.py:42] Received request cmpl-95601026e03646aeb0a1eac0f5af89a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:01 [async_llm.py:261] Added request cmpl-95601026e03646aeb0a1eac0f5af89a2-0. INFO 03-01 18:33:03 [logger.py:42] Received request cmpl-93ec4e0f690b49e585559449e5246eb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:03 [async_llm.py:261] Added request cmpl-93ec4e0f690b49e585559449e5246eb7-0. INFO 03-01 18:33:04 [logger.py:42] Received request cmpl-47f9722ea0a34e398f5430c4e561d518-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:04 [async_llm.py:261] Added request cmpl-47f9722ea0a34e398f5430c4e561d518-0. INFO 03-01 18:33:05 [logger.py:42] Received request cmpl-ab527917347f4349a8ff1d9faa4f0806-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:05 [async_llm.py:261] Added request cmpl-ab527917347f4349a8ff1d9faa4f0806-0. INFO 03-01 18:33:06 [logger.py:42] Received request cmpl-18f7aac423cc42418039158b37872c05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:06 [async_llm.py:261] Added request cmpl-18f7aac423cc42418039158b37872c05-0. INFO 03-01 18:33:07 [logger.py:42] Received request cmpl-1732f1c3d2944e798841894fb42f5449-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:07 [async_llm.py:261] Added request cmpl-1732f1c3d2944e798841894fb42f5449-0. INFO 03-01 18:33:08 [logger.py:42] Received request cmpl-6b875746d86b4b6286b207b047711243-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:08 [async_llm.py:261] Added request cmpl-6b875746d86b4b6286b207b047711243-0. INFO 03-01 18:33:10 [logger.py:42] Received request cmpl-786c07902e3a400082e84de514401d33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:10 [async_llm.py:261] Added request cmpl-786c07902e3a400082e84de514401d33-0. INFO 03-01 18:33:11 [logger.py:42] Received request cmpl-50b2ade7fb084c0e9b02438a33c7e186-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:11 [async_llm.py:261] Added request cmpl-50b2ade7fb084c0e9b02438a33c7e186-0. INFO 03-01 18:33:11 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.0% INFO 03-01 18:33:12 [logger.py:42] Received request cmpl-e5904dcc1cd64ecdb95822ea5404fc9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:12 [async_llm.py:261] Added request cmpl-e5904dcc1cd64ecdb95822ea5404fc9b-0. INFO 03-01 18:33:13 [logger.py:42] Received request cmpl-16a559ebb8ff4bc6964530bed94aab52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:13 [async_llm.py:261] Added request cmpl-16a559ebb8ff4bc6964530bed94aab52-0. INFO 03-01 18:33:14 [logger.py:42] Received request cmpl-e7d021d8a9f644179eaf3a0a50504b97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:14 [async_llm.py:261] Added request cmpl-e7d021d8a9f644179eaf3a0a50504b97-0. INFO 03-01 18:33:16 [logger.py:42] Received request cmpl-768b1a2be65e4b74888e441160c1395f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:16 [async_llm.py:261] Added request cmpl-768b1a2be65e4b74888e441160c1395f-0. INFO 03-01 18:33:17 [logger.py:42] Received request cmpl-f94a0fc840a948d0a7aef3860c709bd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:17 [async_llm.py:261] Added request cmpl-f94a0fc840a948d0a7aef3860c709bd7-0. INFO 03-01 18:33:18 [logger.py:42] Received request cmpl-ec86fb75b51f413d9246a5fa83c5f454-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:18 [async_llm.py:261] Added request cmpl-ec86fb75b51f413d9246a5fa83c5f454-0. INFO 03-01 18:33:19 [logger.py:42] Received request cmpl-5a4e9a859e1e40f8b358ca6597831869-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:19 [async_llm.py:261] Added request cmpl-5a4e9a859e1e40f8b358ca6597831869-0. INFO 03-01 18:33:20 [logger.py:42] Received request cmpl-b393070af15d4ae5bf279baa419cc69c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:20 [async_llm.py:261] Added request cmpl-b393070af15d4ae5bf279baa419cc69c-0. INFO 03-01 18:33:21 [logger.py:42] Received request cmpl-9c54827f726b4cbe9ed578b35b39f1ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:21 [async_llm.py:261] Added request cmpl-9c54827f726b4cbe9ed578b35b39f1ea-0. INFO 03-01 18:33:21 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.0% INFO 03-01 18:33:23 [logger.py:42] Received request cmpl-788043b0f9fb4c15ae6da831fa0e1591-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:23 [async_llm.py:261] Added request cmpl-788043b0f9fb4c15ae6da831fa0e1591-0. INFO 03-01 18:33:24 [logger.py:42] Received request cmpl-8e0014403f714368876aae74abb18eb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:24 [async_llm.py:261] Added request cmpl-8e0014403f714368876aae74abb18eb3-0. INFO 03-01 18:33:25 [logger.py:42] Received request cmpl-69991aa14f354a77b54c08bf7c669282-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:25 [async_llm.py:261] Added request cmpl-69991aa14f354a77b54c08bf7c669282-0. INFO 03-01 18:33:26 [logger.py:42] Received request cmpl-279c44548892415a86dab472f916c562-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:26 [async_llm.py:261] Added request cmpl-279c44548892415a86dab472f916c562-0. INFO 03-01 18:33:27 [logger.py:42] Received request cmpl-1f6d6ac660bc47679d8fe1c5be024bf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:27 [async_llm.py:261] Added request cmpl-1f6d6ac660bc47679d8fe1c5be024bf0-0. INFO 03-01 18:33:28 [logger.py:42] Received request cmpl-bace628df8e84db2ad49a1575a6528fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:28 [async_llm.py:261] Added request cmpl-bace628df8e84db2ad49a1575a6528fc-0. INFO 03-01 18:33:30 [logger.py:42] Received request cmpl-1c3806221cee409bba0f7c1d7aa46935-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:30 [async_llm.py:261] Added request cmpl-1c3806221cee409bba0f7c1d7aa46935-0. INFO 03-01 18:33:31 [logger.py:42] Received request cmpl-1256210641584daab7f667c4560471a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:31 [async_llm.py:261] Added request cmpl-1256210641584daab7f667c4560471a3-0. INFO 03-01 18:33:31 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.1% INFO 03-01 18:33:32 [logger.py:42] Received request cmpl-a23f418fafdc4f269dbda17c149931cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:32 [async_llm.py:261] Added request cmpl-a23f418fafdc4f269dbda17c149931cf-0. INFO 03-01 18:33:33 [logger.py:42] Received request cmpl-cf602c86df034fe4ac332b30c7725f5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:33 [async_llm.py:261] Added request cmpl-cf602c86df034fe4ac332b30c7725f5e-0. INFO 03-01 18:33:34 [logger.py:42] Received request cmpl-294813a7ba42414d92d6ee5e6c2768b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:34 [async_llm.py:261] Added request cmpl-294813a7ba42414d92d6ee5e6c2768b5-0. INFO 03-01 18:33:36 [logger.py:42] Received request cmpl-e1a751a82aa245fab8283d6f757fa509-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:36 [async_llm.py:261] Added request cmpl-e1a751a82aa245fab8283d6f757fa509-0. INFO 03-01 18:33:37 [logger.py:42] Received request cmpl-f1e7553f112f437485853dd168facdb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:37 [async_llm.py:261] Added request cmpl-f1e7553f112f437485853dd168facdb8-0. INFO 03-01 18:33:38 [logger.py:42] Received request cmpl-f02e7b3579eb4924bbc38a0cb08e1a4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:38 [async_llm.py:261] Added request cmpl-f02e7b3579eb4924bbc38a0cb08e1a4b-0. INFO 03-01 18:33:39 [logger.py:42] Received request cmpl-5ea1640dbc6d4619835a88c6df299531-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:39 [async_llm.py:261] Added request cmpl-5ea1640dbc6d4619835a88c6df299531-0. INFO 03-01 18:33:40 [logger.py:42] Received request cmpl-33de646e59974df193b975c98e840cac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:40 [async_llm.py:261] Added request cmpl-33de646e59974df193b975c98e840cac-0. INFO 03-01 18:33:41 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.1% INFO 03-01 18:33:42 [logger.py:42] Received request cmpl-970ed4cef9fb4def81adf31d64d12666-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:42 [async_llm.py:261] Added request cmpl-970ed4cef9fb4def81adf31d64d12666-0. INFO 03-01 18:33:43 [logger.py:42] Received request cmpl-1f6801dfdd3d48c7a7c3bed0201f62b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:43 [async_llm.py:261] Added request cmpl-1f6801dfdd3d48c7a7c3bed0201f62b6-0. INFO 03-01 18:33:44 [logger.py:42] Received request cmpl-e9fa713a23f441c0bdd2aa8c975545bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:44 [async_llm.py:261] Added request cmpl-e9fa713a23f441c0bdd2aa8c975545bd-0. INFO 03-01 18:33:45 [logger.py:42] Received request cmpl-526048f471e94a2f97bd0d0c0daca76c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:45 [async_llm.py:261] Added request cmpl-526048f471e94a2f97bd0d0c0daca76c-0. INFO 03-01 18:33:46 [logger.py:42] Received request cmpl-062e69cb2c964c3a8ccdd2cef894bfc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:46 [async_llm.py:261] Added request cmpl-062e69cb2c964c3a8ccdd2cef894bfc1-0. INFO 03-01 18:33:47 [logger.py:42] Received request cmpl-620abdfa54d6467eb277cf8d272c696d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:47 [async_llm.py:261] Added request cmpl-620abdfa54d6467eb277cf8d272c696d-0. INFO 03-01 18:33:49 [logger.py:42] Received request cmpl-a5cd7ca1368b48109f3a5fd345de50a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:49 [async_llm.py:261] Added request cmpl-a5cd7ca1368b48109f3a5fd345de50a2-0. INFO 03-01 18:33:50 [logger.py:42] Received request cmpl-e61649096f164c13a025b8cba935a26b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:50 [async_llm.py:261] Added request cmpl-e61649096f164c13a025b8cba935a26b-0. INFO 03-01 18:33:51 [logger.py:42] Received request cmpl-0c948bad261e49f3be152c140b6ce253-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:51 [async_llm.py:261] Added request cmpl-0c948bad261e49f3be152c140b6ce253-0. INFO 03-01 18:33:51 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.2% INFO 03-01 18:33:52 [logger.py:42] Received request cmpl-0979b618c8e441af93122eff21dd8c9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:52 [async_llm.py:261] Added request cmpl-0979b618c8e441af93122eff21dd8c9b-0. INFO 03-01 18:33:53 [logger.py:42] Received request cmpl-ef783cfd669c461fbf8c118648d4ac80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:53 [async_llm.py:261] Added request cmpl-ef783cfd669c461fbf8c118648d4ac80-0. INFO 03-01 18:33:54 [logger.py:42] Received request cmpl-8618ee79c0ca4f40842fc429bd6a4356-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:54 [async_llm.py:261] Added request cmpl-8618ee79c0ca4f40842fc429bd6a4356-0. INFO 03-01 18:33:55 [logger.py:42] Received request cmpl-b3a05dea96a2414abbb87db0cdb9eb9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:55 [async_llm.py:261] Added request cmpl-b3a05dea96a2414abbb87db0cdb9eb9a-0. INFO 03-01 18:33:57 [logger.py:42] Received request cmpl-96ded003c4594a2993f83b1581ff94dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:57 [async_llm.py:261] Added request cmpl-96ded003c4594a2993f83b1581ff94dc-0. INFO 03-01 18:33:58 [logger.py:42] Received request cmpl-1489f40fe37f47858a02a611619f7e0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:58 [async_llm.py:261] Added request cmpl-1489f40fe37f47858a02a611619f7e0a-0. INFO 03-01 18:33:59 [logger.py:42] Received request cmpl-4fa50accca8446c78f6bc1573625d904-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:33:59 [async_llm.py:261] Added request cmpl-4fa50accca8446c78f6bc1573625d904-0. INFO 03-01 18:34:00 [logger.py:42] Received request cmpl-c0bb46040469493c9fd3538eae64c145-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:00 [async_llm.py:261] Added request cmpl-c0bb46040469493c9fd3538eae64c145-0. INFO 03-01 18:34:01 [logger.py:42] Received request cmpl-08aa5155327246b1909abce0b96b47c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:01 [async_llm.py:261] Added request cmpl-08aa5155327246b1909abce0b96b47c0-0. INFO 03-01 18:34:01 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.2% INFO 03-01 18:34:02 [logger.py:42] Received request cmpl-68d34945d0e44b42961598fd48764d15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:02 [async_llm.py:261] Added request cmpl-68d34945d0e44b42961598fd48764d15-0. INFO 03-01 18:34:04 [logger.py:42] Received request cmpl-b8dfd5c53e994e25992366f6b4bd754d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:04 [async_llm.py:261] Added request cmpl-b8dfd5c53e994e25992366f6b4bd754d-0. INFO 03-01 18:34:05 [logger.py:42] Received request cmpl-3d56bf74457f4d078c9b82822a196d7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:05 [async_llm.py:261] Added request cmpl-3d56bf74457f4d078c9b82822a196d7e-0. INFO 03-01 18:34:06 [logger.py:42] Received request cmpl-4340d9e19cde46c68151a745792342b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:06 [async_llm.py:261] Added request cmpl-4340d9e19cde46c68151a745792342b3-0. INFO 03-01 18:34:07 [logger.py:42] Received request cmpl-80a617aacb6e48c58bd22a18be2a0cd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:07 [async_llm.py:261] Added request cmpl-80a617aacb6e48c58bd22a18be2a0cd2-0. INFO 03-01 18:34:08 [logger.py:42] Received request cmpl-5bc3df36905941098f3b73e0253d2673-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:08 [async_llm.py:261] Added request cmpl-5bc3df36905941098f3b73e0253d2673-0. INFO 03-01 18:34:09 [logger.py:42] Received request cmpl-f298957f5e024414863e527d6d549f68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:09 [async_llm.py:261] Added request cmpl-f298957f5e024414863e527d6d549f68-0. INFO 03-01 18:34:11 [logger.py:42] Received request cmpl-2d2758c3e5144a88acb56b5097128f07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:11 [async_llm.py:261] Added request cmpl-2d2758c3e5144a88acb56b5097128f07-0. INFO 03-01 18:34:11 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.2% INFO 03-01 18:34:12 [logger.py:42] Received request cmpl-52303f60c4fd4bfcb942a3327cc134d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:12 [async_llm.py:261] Added request cmpl-52303f60c4fd4bfcb942a3327cc134d5-0. INFO 03-01 18:34:13 [logger.py:42] Received request cmpl-e5db13d6a30c462b91e072143d06fb49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:13 [async_llm.py:261] Added request cmpl-e5db13d6a30c462b91e072143d06fb49-0. INFO 03-01 18:34:14 [logger.py:42] Received request cmpl-d079dd169e92453caba3835f14336bec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:14 [async_llm.py:261] Added request cmpl-d079dd169e92453caba3835f14336bec-0. INFO 03-01 18:34:15 [logger.py:42] Received request cmpl-83d2f17427614507b072016906edd3da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:15 [async_llm.py:261] Added request cmpl-83d2f17427614507b072016906edd3da-0. INFO 03-01 18:34:16 [logger.py:42] Received request cmpl-2362d528f84a4bd5b5b306e62d183ab8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:16 [async_llm.py:261] Added request cmpl-2362d528f84a4bd5b5b306e62d183ab8-0. INFO 03-01 18:34:18 [logger.py:42] Received request cmpl-8268a70eae004136a706391289f521be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:18 [async_llm.py:261] Added request cmpl-8268a70eae004136a706391289f521be-0. INFO 03-01 18:34:18 [logger.py:42] Received request cmpl-edc0977b58484986bb505b6a85d5513b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=800, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.4:123 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:18 [async_llm.py:261] Added request cmpl-edc0977b58484986bb505b6a85d5513b-0. INFO 03-01 18:34:19 [logger.py:42] Received request cmpl-1fbb113d7eb441b3af2b3c74cce89714-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:19 [async_llm.py:261] Added request cmpl-1fbb113d7eb441b3af2b3c74cce89714-0. INFO 03-01 18:34:20 [logger.py:42] Received request cmpl-42113177011a4366a665e341d0dc3000-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:20 [async_llm.py:261] Added request cmpl-42113177011a4366a665e341d0dc3000-0. INFO 03-01 18:34:21 [logger.py:42] Received request cmpl-8f39e8d1e28e4de4be9b2bf44850e1b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:21 [async_llm.py:261] Added request cmpl-8f39e8d1e28e4de4be9b2bf44850e1b8-0. INFO 03-01 18:34:21 [loggers.py:116] Engine 000: Avg prompt throughput: 31.0 tokens/s, Avg generation throughput: 16.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 1.0%, Prefix cache hit rate: 51.2% INFO 03-01 18:34:22 [logger.py:42] Received request cmpl-65bd72a8567a4a528cadb17db367885d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:22 [async_llm.py:261] Added request cmpl-65bd72a8567a4a528cadb17db367885d-0. INFO 03-01 18:34:23 [logger.py:42] Received request cmpl-10a47500daa34feea0c306732136d26f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:23 [async_llm.py:261] Added request cmpl-10a47500daa34feea0c306732136d26f-0. INFO 03-01 18:34:25 [logger.py:42] Received request cmpl-93b317a6cb6349afbb40771dd3d5b638-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:25 [async_llm.py:261] Added request cmpl-93b317a6cb6349afbb40771dd3d5b638-0. INFO 03-01 18:34:26 [logger.py:42] Received request cmpl-3e39fbeb9fe14c86859c992b3ff549c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:26 [async_llm.py:261] Added request cmpl-3e39fbeb9fe14c86859c992b3ff549c1-0. INFO 03-01 18:34:27 [logger.py:42] Received request cmpl-0452390df63748e68b63d85327823124-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:27 [async_llm.py:261] Added request cmpl-0452390df63748e68b63d85327823124-0. INFO 03-01 18:34:28 [logger.py:42] Received request cmpl-ade274a03c6a4899a00bc64d2a268743-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:28 [async_llm.py:261] Added request cmpl-ade274a03c6a4899a00bc64d2a268743-0. INFO 03-01 18:34:29 [logger.py:42] Received request cmpl-1399f298563846289ba19168c7a3501f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:29 [async_llm.py:261] Added request cmpl-1399f298563846289ba19168c7a3501f-0. INFO 03-01 18:34:30 [logger.py:42] Received request cmpl-844b0ec8088f4b488ae2ac83fd101b20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:30 [async_llm.py:261] Added request cmpl-844b0ec8088f4b488ae2ac83fd101b20-0. INFO 03-01 18:34:31 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 26.7 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3% INFO 03-01 18:34:32 [logger.py:42] Received request cmpl-130c63f3b0d449aebb7cc45492ad9190-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:32 [async_llm.py:261] Added request cmpl-130c63f3b0d449aebb7cc45492ad9190-0. INFO 03-01 18:34:33 [logger.py:42] Received request cmpl-ec6f4c3cedb04ce689a3e88d99f8ebf1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:33 [async_llm.py:261] Added request cmpl-ec6f4c3cedb04ce689a3e88d99f8ebf1-0. INFO 03-01 18:34:34 [logger.py:42] Received request cmpl-7d8301b753af429d853b29207ff04b4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:34 [async_llm.py:261] Added request cmpl-7d8301b753af429d853b29207ff04b4c-0. INFO 03-01 18:34:35 [logger.py:42] Received request cmpl-2a2108b89f1042dfa4df6a3df988697a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:35 [async_llm.py:261] Added request cmpl-2a2108b89f1042dfa4df6a3df988697a-0. INFO 03-01 18:34:36 [logger.py:42] Received request cmpl-9fab1fb279224f22ab86dad4b4fa294d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:36 [async_llm.py:261] Added request cmpl-9fab1fb279224f22ab86dad4b4fa294d-0. INFO 03-01 18:34:37 [logger.py:42] Received request cmpl-ff0b3e911d96400e90ef0e0632768687-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:38 [async_llm.py:261] Added request cmpl-ff0b3e911d96400e90ef0e0632768687-0. INFO 03-01 18:34:39 [logger.py:42] Received request cmpl-5348f44a1c1a4c9499525e6836ff8472-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:39 [async_llm.py:261] Added request cmpl-5348f44a1c1a4c9499525e6836ff8472-0. INFO 03-01 18:34:40 [logger.py:42] Received request cmpl-ea8141e2c1dc4d5eb4959ff2ea8f8710-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:40 [async_llm.py:261] Added request cmpl-ea8141e2c1dc4d5eb4959ff2ea8f8710-0. INFO 03-01 18:34:41 [logger.py:42] Received request cmpl-307e41fe74f14fc9bf4ab64824ab7750-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:41 [async_llm.py:261] Added request cmpl-307e41fe74f14fc9bf4ab64824ab7750-0. INFO 03-01 18:34:41 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3% INFO 03-01 18:34:42 [logger.py:42] Received request cmpl-e27402b9ad7d4b2d96f833baa3e0288a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:42 [async_llm.py:261] Added request cmpl-e27402b9ad7d4b2d96f833baa3e0288a-0. INFO 03-01 18:34:43 [logger.py:42] Received request cmpl-6eef7723118c41a3897cc00cd8879087-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:43 [async_llm.py:261] Added request cmpl-6eef7723118c41a3897cc00cd8879087-0. INFO 03-01 18:34:44 [logger.py:42] Received request cmpl-d32c843b9dda42db934b89921e11e8a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:44 [async_llm.py:261] Added request cmpl-d32c843b9dda42db934b89921e11e8a7-0. INFO 03-01 18:34:46 [logger.py:42] Received request cmpl-b545945002fd43eab658b61f7b986be1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:46 [async_llm.py:261] Added request cmpl-b545945002fd43eab658b61f7b986be1-0. INFO 03-01 18:34:47 [logger.py:42] Received request cmpl-6cc60f7e28bb476db9ee6e95bae8b5fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:47 [async_llm.py:261] Added request cmpl-6cc60f7e28bb476db9ee6e95bae8b5fe-0. INFO 03-01 18:34:48 [logger.py:42] Received request cmpl-528e90c0d92b4e05bd37bc4c029b6f57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:48 [async_llm.py:261] Added request cmpl-528e90c0d92b4e05bd37bc4c029b6f57-0. INFO 03-01 18:34:49 [logger.py:42] Received request cmpl-d1e2f3c242d442e6bcfbfda647089f4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:49 [async_llm.py:261] Added request cmpl-d1e2f3c242d442e6bcfbfda647089f4e-0. INFO 03-01 18:34:50 [logger.py:42] Received request cmpl-fdf49a2bdbe54962a4b859a6a67938d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:50 [async_llm.py:261] Added request cmpl-fdf49a2bdbe54962a4b859a6a67938d1-0. INFO 03-01 18:34:51 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3% INFO 03-01 18:34:52 [logger.py:42] Received request cmpl-a61b3d85e0af49d69b1ed0d47a3128ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:52 [async_llm.py:261] Added request cmpl-a61b3d85e0af49d69b1ed0d47a3128ff-0. INFO 03-01 18:34:53 [logger.py:42] Received request cmpl-dce1068f333647cc85ded9dd8598793e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:53 [async_llm.py:261] Added request cmpl-dce1068f333647cc85ded9dd8598793e-0. INFO 03-01 18:34:54 [logger.py:42] Received request cmpl-f13a77cc8b3e4114b302a6dc7f8a7b68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:54 [async_llm.py:261] Added request cmpl-f13a77cc8b3e4114b302a6dc7f8a7b68-0. INFO 03-01 18:34:55 [logger.py:42] Received request cmpl-2716d41ce1e94b6f834683144cf7c580-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:55 [async_llm.py:261] Added request cmpl-2716d41ce1e94b6f834683144cf7c580-0. INFO 03-01 18:34:56 [logger.py:42] Received request cmpl-48a216956a8b4226b7e02c176d5b20e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:56 [async_llm.py:261] Added request cmpl-48a216956a8b4226b7e02c176d5b20e3-0. INFO 03-01 18:34:57 [logger.py:42] Received request cmpl-7f7ba1003f2c46829bbca6c98a509d8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:57 [async_llm.py:261] Added request cmpl-7f7ba1003f2c46829bbca6c98a509d8e-0. INFO 03-01 18:34:59 [logger.py:42] Received request cmpl-b21d256b31d3493288ac6c97d85afab6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:34:59 [async_llm.py:261] Added request cmpl-b21d256b31d3493288ac6c97d85afab6-0. INFO 03-01 18:35:00 [logger.py:42] Received request cmpl-0eb05a2bd70a4babb52b8f2af0145c39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:00 [async_llm.py:261] Added request cmpl-0eb05a2bd70a4babb52b8f2af0145c39-0. INFO 03-01 18:35:01 [logger.py:42] Received request cmpl-934f7a8276ee489997fe1280d561ed4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:01 [async_llm.py:261] Added request cmpl-934f7a8276ee489997fe1280d561ed4e-0. INFO 03-01 18:35:01 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3% INFO 03-01 18:35:02 [logger.py:42] Received request cmpl-b6dd9a21ed374d048f096d78ae72b187-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:02 [async_llm.py:261] Added request cmpl-b6dd9a21ed374d048f096d78ae72b187-0. INFO 03-01 18:35:03 [logger.py:42] Received request cmpl-c18822abb7cc49e798367ccecf2cf052-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:03 [async_llm.py:261] Added request cmpl-c18822abb7cc49e798367ccecf2cf052-0. INFO 03-01 18:35:04 [logger.py:42] Received request cmpl-c741fccbad26452d91a68543f246f5c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:04 [async_llm.py:261] Added request cmpl-c741fccbad26452d91a68543f246f5c9-0. INFO 03-01 18:35:06 [logger.py:42] Received request cmpl-f8745c47bb1448fc9d77c835677dc192-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:06 [async_llm.py:261] Added request cmpl-f8745c47bb1448fc9d77c835677dc192-0. INFO 03-01 18:35:07 [logger.py:42] Received request cmpl-8e1e0c69464549cd8de9b1db84cdeaa3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:07 [async_llm.py:261] Added request cmpl-8e1e0c69464549cd8de9b1db84cdeaa3-0. INFO 03-01 18:35:08 [logger.py:42] Received request cmpl-570071ad4e43440bb9891eaeaab09ca5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:08 [async_llm.py:261] Added request cmpl-570071ad4e43440bb9891eaeaab09ca5-0. INFO 03-01 18:35:09 [logger.py:42] Received request cmpl-70a4f062af3f4e7289658865ac34f1d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:09 [async_llm.py:261] Added request cmpl-70a4f062af3f4e7289658865ac34f1d9-0. INFO 03-01 18:35:10 [logger.py:42] Received request cmpl-e8f3979ccea9401c979da0886afeb801-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:10 [async_llm.py:261] Added request cmpl-e8f3979ccea9401c979da0886afeb801-0. INFO 03-01 18:35:11 [logger.py:42] Received request cmpl-3db204a51bec45ef8144f3f35da988e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:11 [async_llm.py:261] Added request cmpl-3db204a51bec45ef8144f3f35da988e9-0. INFO 03-01 18:35:11 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.3% INFO 03-01 18:35:13 [logger.py:42] Received request cmpl-397272966171442f82ba22201be9ccef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:13 [async_llm.py:261] Added request cmpl-397272966171442f82ba22201be9ccef-0. INFO 03-01 18:35:14 [logger.py:42] Received request cmpl-e83bd30ee95845129e07d44cda0dfba4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:14 [async_llm.py:261] Added request cmpl-e83bd30ee95845129e07d44cda0dfba4-0. INFO 03-01 18:35:15 [logger.py:42] Received request cmpl-ddc06f27be7443839bee4e66ac120aa0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:15 [async_llm.py:261] Added request cmpl-ddc06f27be7443839bee4e66ac120aa0-0. INFO 03-01 18:35:16 [logger.py:42] Received request cmpl-87c9dc551b5b4d5f8cffdfd7249e4f1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:16 [async_llm.py:261] Added request cmpl-87c9dc551b5b4d5f8cffdfd7249e4f1c-0. INFO 03-01 18:35:17 [logger.py:42] Received request cmpl-5aa59c8f264245e58923f46f1a5c9e6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:17 [async_llm.py:261] Added request cmpl-5aa59c8f264245e58923f46f1a5c9e6d-0. INFO 03-01 18:35:18 [logger.py:42] Received request cmpl-76c27ec06d8a4f0aa1c1eefa8eb35cb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:18 [async_llm.py:261] Added request cmpl-76c27ec06d8a4f0aa1c1eefa8eb35cb8-0. INFO 03-01 18:35:20 [logger.py:42] Received request cmpl-26c66b59d0604f7ea393366dae385eef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:20 [async_llm.py:261] Added request cmpl-26c66b59d0604f7ea393366dae385eef-0. INFO 03-01 18:35:21 [logger.py:42] Received request cmpl-0c16347b6f6042a0bc5e715f0b2eaa0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:21 [async_llm.py:261] Added request cmpl-0c16347b6f6042a0bc5e715f0b2eaa0e-0. INFO 03-01 18:35:21 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.3% INFO 03-01 18:35:22 [logger.py:42] Received request cmpl-f2cac1232bb149a4a899b465cb7b3d23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:22 [async_llm.py:261] Added request cmpl-f2cac1232bb149a4a899b465cb7b3d23-0. INFO 03-01 18:35:23 [logger.py:42] Received request cmpl-6e5e02330a204abcb50b4a8a81aac7c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:23 [async_llm.py:261] Added request cmpl-6e5e02330a204abcb50b4a8a81aac7c0-0. INFO 03-01 18:35:24 [logger.py:42] Received request cmpl-9b4ef17a159146c08a1884692f69ed5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:24 [async_llm.py:261] Added request cmpl-9b4ef17a159146c08a1884692f69ed5d-0. INFO 03-01 18:35:25 [logger.py:42] Received request cmpl-80296966b8204d46b19223dd98aa1130-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:25 [async_llm.py:261] Added request cmpl-80296966b8204d46b19223dd98aa1130-0. INFO 03-01 18:35:26 [logger.py:42] Received request cmpl-13cb670bacfc429a8a9a1868aa2cbe5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:26 [async_llm.py:261] Added request cmpl-13cb670bacfc429a8a9a1868aa2cbe5c-0. INFO 03-01 18:35:28 [logger.py:42] Received request cmpl-b12b83d8107d4a118fe99a800b8c4331-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:28 [async_llm.py:261] Added request cmpl-b12b83d8107d4a118fe99a800b8c4331-0. INFO 03-01 18:35:29 [logger.py:42] Received request cmpl-bad75a0fa6f149678be5c39573afc731-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:29 [async_llm.py:261] Added request cmpl-bad75a0fa6f149678be5c39573afc731-0. INFO 03-01 18:35:30 [logger.py:42] Received request cmpl-3950c605e19948f68c7537f10861628e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:30 [async_llm.py:261] Added request cmpl-3950c605e19948f68c7537f10861628e-0. INFO 03-01 18:35:31 [logger.py:42] Received request cmpl-759a3d0a4d3d4cb99af9b0146e6eeb9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:31 [async_llm.py:261] Added request cmpl-759a3d0a4d3d4cb99af9b0146e6eeb9b-0. INFO 03-01 18:35:31 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:35:32 [logger.py:42] Received request cmpl-cb72f6c929e4426ab4190a26da3c9a39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:32 [async_llm.py:261] Added request cmpl-cb72f6c929e4426ab4190a26da3c9a39-0. INFO 03-01 18:35:33 [logger.py:42] Received request cmpl-85660d35270047dcb15ccf0eb7770d0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:33 [async_llm.py:261] Added request cmpl-85660d35270047dcb15ccf0eb7770d0b-0. INFO 03-01 18:35:35 [logger.py:42] Received request cmpl-24787ef724ba4851b9b5d2665f68e4e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:35 [async_llm.py:261] Added request cmpl-24787ef724ba4851b9b5d2665f68e4e3-0. INFO 03-01 18:35:36 [logger.py:42] Received request cmpl-770f62a2f38f4827956f8ac5a80fe289-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:36 [async_llm.py:261] Added request cmpl-770f62a2f38f4827956f8ac5a80fe289-0. INFO 03-01 18:35:37 [logger.py:42] Received request cmpl-2df433b128b34bd98f8a8c2a24d23014-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:37 [async_llm.py:261] Added request cmpl-2df433b128b34bd98f8a8c2a24d23014-0. INFO 03-01 18:35:38 [logger.py:42] Received request cmpl-1c32ab2725b1408fbc3d7ebf09fca3bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:38 [async_llm.py:261] Added request cmpl-1c32ab2725b1408fbc3d7ebf09fca3bc-0. INFO 03-01 18:35:39 [logger.py:42] Received request cmpl-b86c45cb17244c2386d23cce5af50cff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:39 [async_llm.py:261] Added request cmpl-b86c45cb17244c2386d23cce5af50cff-0. INFO 03-01 18:35:40 [logger.py:42] Received request cmpl-abbc9292fcb04468a3ddfb95e96281fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:40 [async_llm.py:261] Added request cmpl-abbc9292fcb04468a3ddfb95e96281fe-0. INFO 03-01 18:35:41 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:35:42 [logger.py:42] Received request cmpl-90c275e906e347519ed66346c84924b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:42 [async_llm.py:261] Added request cmpl-90c275e906e347519ed66346c84924b4-0. INFO 03-01 18:35:43 [logger.py:42] Received request cmpl-934484cadbea4635b2b21da78d7fa921-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:43 [async_llm.py:261] Added request cmpl-934484cadbea4635b2b21da78d7fa921-0. INFO 03-01 18:35:44 [logger.py:42] Received request cmpl-b5a892fbc2f54e63b9f1b6515c263eab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:44 [async_llm.py:261] Added request cmpl-b5a892fbc2f54e63b9f1b6515c263eab-0. INFO 03-01 18:35:45 [logger.py:42] Received request cmpl-264466b1d4ff412cb1f52abfce678122-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:45 [async_llm.py:261] Added request cmpl-264466b1d4ff412cb1f52abfce678122-0. INFO 03-01 18:35:46 [logger.py:42] Received request cmpl-1322ebdf30b34c269cbaa896c2933344-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:46 [async_llm.py:261] Added request cmpl-1322ebdf30b34c269cbaa896c2933344-0. INFO 03-01 18:35:47 [logger.py:42] Received request cmpl-bc040583810e4c6c8b92a7c629a9cee3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:47 [async_llm.py:261] Added request cmpl-bc040583810e4c6c8b92a7c629a9cee3-0. INFO 03-01 18:35:49 [logger.py:42] Received request cmpl-bb3c84c7377d4d84b54819bf1d5a5455-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:49 [async_llm.py:261] Added request cmpl-bb3c84c7377d4d84b54819bf1d5a5455-0. INFO 03-01 18:35:50 [logger.py:42] Received request cmpl-7b52af0e9d604a82a790bf8b6a3e24c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:50 [async_llm.py:261] Added request cmpl-7b52af0e9d604a82a790bf8b6a3e24c2-0. INFO 03-01 18:35:51 [logger.py:42] Received request cmpl-3482173e75d8462a825f486aa60f820a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:51 [async_llm.py:261] Added request cmpl-3482173e75d8462a825f486aa60f820a-0. INFO 03-01 18:35:51 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:35:52 [logger.py:42] Received request cmpl-de39d13c905740a0bc8273ba7aa57748-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:52 [async_llm.py:261] Added request cmpl-de39d13c905740a0bc8273ba7aa57748-0. INFO 03-01 18:35:53 [logger.py:42] Received request cmpl-6ddf0677877d49e98f94be5fffec575a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:53 [async_llm.py:261] Added request cmpl-6ddf0677877d49e98f94be5fffec575a-0. INFO 03-01 18:35:54 [logger.py:42] Received request cmpl-1619f22fb15449b39ffd7e83617f89b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:54 [async_llm.py:261] Added request cmpl-1619f22fb15449b39ffd7e83617f89b1-0. INFO 03-01 18:35:55 [logger.py:42] Received request cmpl-df2a572c974f4d09bcb69ad8f65d9a7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:55 [async_llm.py:261] Added request cmpl-df2a572c974f4d09bcb69ad8f65d9a7d-0. INFO 03-01 18:35:57 [logger.py:42] Received request cmpl-fee8e084bde7467886ef0155751d43c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:57 [async_llm.py:261] Added request cmpl-fee8e084bde7467886ef0155751d43c1-0. INFO 03-01 18:35:58 [logger.py:42] Received request cmpl-63a78c1b116c476c86ec9600748aa059-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:58 [async_llm.py:261] Added request cmpl-63a78c1b116c476c86ec9600748aa059-0. INFO 03-01 18:35:59 [logger.py:42] Received request cmpl-77e4f139a6e6476a9e1d6e35a1ede5dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:35:59 [async_llm.py:261] Added request cmpl-77e4f139a6e6476a9e1d6e35a1ede5dc-0. INFO 03-01 18:36:00 [logger.py:42] Received request cmpl-ceb5b9a7baa94518ab2bc56ac3c04e80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:00 [async_llm.py:261] Added request cmpl-ceb5b9a7baa94518ab2bc56ac3c04e80-0. INFO 03-01 18:36:01 [logger.py:42] Received request cmpl-20c3c5eba49b43e09a28381f9e8b4a25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:01 [async_llm.py:261] Added request cmpl-20c3c5eba49b43e09a28381f9e8b4a25-0. INFO 03-01 18:36:01 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.4% INFO 03-01 18:36:03 [logger.py:42] Received request cmpl-30ec2dc95c3d4848b3da21640a3b590a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:03 [async_llm.py:261] Added request cmpl-30ec2dc95c3d4848b3da21640a3b590a-0. INFO 03-01 18:36:04 [logger.py:42] Received request cmpl-c7c3fc1ddd124905b0eb09f1b46119c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:04 [async_llm.py:261] Added request cmpl-c7c3fc1ddd124905b0eb09f1b46119c9-0. INFO 03-01 18:36:05 [logger.py:42] Received request cmpl-3d249922242b4ef383dd75df8cb203a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:05 [async_llm.py:261] Added request cmpl-3d249922242b4ef383dd75df8cb203a2-0. INFO 03-01 18:36:06 [logger.py:42] Received request cmpl-7a067f72c39a429984b11ec6aef1766f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:06 [async_llm.py:261] Added request cmpl-7a067f72c39a429984b11ec6aef1766f-0. INFO 03-01 18:36:08 [logger.py:42] Received request cmpl-199b0e79d7104d11871151cf03eccb98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:08 [async_llm.py:261] Added request cmpl-199b0e79d7104d11871151cf03eccb98-0. INFO 03-01 18:36:09 [logger.py:42] Received request cmpl-c8455a17c32145329d342ecde50729be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:09 [async_llm.py:261] Added request cmpl-c8455a17c32145329d342ecde50729be-0. INFO 03-01 18:36:10 [logger.py:42] Received request cmpl-a0941f4bfdf04b97a60df5db340affa7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:10 [async_llm.py:261] Added request cmpl-a0941f4bfdf04b97a60df5db340affa7-0. INFO 03-01 18:36:11 [logger.py:42] Received request cmpl-92e36159f632499387266fc9d4f56b2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:11 [async_llm.py:261] Added request cmpl-92e36159f632499387266fc9d4f56b2c-0. INFO 03-01 18:36:11 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:36:12 [logger.py:42] Received request cmpl-8a6cd7c693844303abf53cee29204b50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:12 [async_llm.py:261] Added request cmpl-8a6cd7c693844303abf53cee29204b50-0. INFO 03-01 18:36:14 [logger.py:42] Received request cmpl-afe0312d7fdc4e7da06e435234cd48d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:14 [async_llm.py:261] Added request cmpl-afe0312d7fdc4e7da06e435234cd48d1-0. INFO 03-01 18:36:15 [logger.py:42] Received request cmpl-a42cc5fbf47d497883acaae96d72144e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:15 [async_llm.py:261] Added request cmpl-a42cc5fbf47d497883acaae96d72144e-0. INFO 03-01 18:36:16 [logger.py:42] Received request cmpl-9383965e2139442f8da69b79aa1a416f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:16 [async_llm.py:261] Added request cmpl-9383965e2139442f8da69b79aa1a416f-0. INFO 03-01 18:36:17 [logger.py:42] Received request cmpl-dd46b8b4baef4609997d8c71df0ba2c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:17 [async_llm.py:261] Added request cmpl-dd46b8b4baef4609997d8c71df0ba2c2-0. INFO 03-01 18:36:18 [logger.py:42] Received request cmpl-897d31bcf03c45c28db57f6f23bcab32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:18 [async_llm.py:261] Added request cmpl-897d31bcf03c45c28db57f6f23bcab32-0. INFO 03-01 18:36:19 [logger.py:42] Received request cmpl-c41e6fa9ab2e4519b59fecc3d80ac12a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:19 [async_llm.py:261] Added request cmpl-c41e6fa9ab2e4519b59fecc3d80ac12a-0. INFO 03-01 18:36:21 [logger.py:42] Received request cmpl-ce12e104229e4f2fad72966313b4f3e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:21 [async_llm.py:261] Added request cmpl-ce12e104229e4f2fad72966313b4f3e8-0. INFO 03-01 18:36:21 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:36:22 [logger.py:42] Received request cmpl-0682871114cc45b2b3031dc59a96b970-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:22 [async_llm.py:261] Added request cmpl-0682871114cc45b2b3031dc59a96b970-0. INFO 03-01 18:36:23 [logger.py:42] Received request cmpl-b38b538c8c674b39897875d4600c63ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:23 [async_llm.py:261] Added request cmpl-b38b538c8c674b39897875d4600c63ab-0. INFO 03-01 18:36:24 [logger.py:42] Received request cmpl-fd5f870a2a344db9b726c713054d358c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:24 [async_llm.py:261] Added request cmpl-fd5f870a2a344db9b726c713054d358c-0. INFO 03-01 18:36:25 [logger.py:42] Received request cmpl-8eab59097e0443d2b64818640c9cb62b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:25 [async_llm.py:261] Added request cmpl-8eab59097e0443d2b64818640c9cb62b-0. INFO 03-01 18:36:26 [logger.py:42] Received request cmpl-5eaa3214f5e24b0480464a917f08853d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:26 [async_llm.py:261] Added request cmpl-5eaa3214f5e24b0480464a917f08853d-0. INFO 03-01 18:36:28 [logger.py:42] Received request cmpl-039f91a2c05149558230eb3c2d99d591-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:28 [async_llm.py:261] Added request cmpl-039f91a2c05149558230eb3c2d99d591-0. INFO 03-01 18:36:29 [logger.py:42] Received request cmpl-35255850c0534d85b46f1910e250f77a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:29 [async_llm.py:261] Added request cmpl-35255850c0534d85b46f1910e250f77a-0. INFO 03-01 18:36:30 [logger.py:42] Received request cmpl-c88b4d8978b14340a62b94e503bb591d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:30 [async_llm.py:261] Added request cmpl-c88b4d8978b14340a62b94e503bb591d-0. INFO 03-01 18:36:31 [logger.py:42] Received request cmpl-03c2119dc6c1490fa8119c7aee13fff1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:31 [async_llm.py:261] Added request cmpl-03c2119dc6c1490fa8119c7aee13fff1-0. INFO 03-01 18:36:31 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:36:32 [logger.py:42] Received request cmpl-cc4fa692053944adbf16b4b165ffec44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:32 [async_llm.py:261] Added request cmpl-cc4fa692053944adbf16b4b165ffec44-0. INFO 03-01 18:36:33 [logger.py:42] Received request cmpl-4b79b8a5f31545c1b3d5357f83170537-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:33 [async_llm.py:261] Added request cmpl-4b79b8a5f31545c1b3d5357f83170537-0. INFO 03-01 18:36:35 [logger.py:42] Received request cmpl-701affccec1f4a0e823de94ab8fc2451-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:35 [async_llm.py:261] Added request cmpl-701affccec1f4a0e823de94ab8fc2451-0. INFO 03-01 18:36:36 [logger.py:42] Received request cmpl-2614067ed84449c3ad9713744c4196a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:36 [async_llm.py:261] Added request cmpl-2614067ed84449c3ad9713744c4196a5-0. INFO 03-01 18:36:37 [logger.py:42] Received request cmpl-3744599f5c104ee28a36af0962975d57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:37 [async_llm.py:261] Added request cmpl-3744599f5c104ee28a36af0962975d57-0. INFO 03-01 18:36:38 [logger.py:42] Received request cmpl-c0fbb3c5b90a4c888c763b9ea403a29f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:38 [async_llm.py:261] Added request cmpl-c0fbb3c5b90a4c888c763b9ea403a29f-0. INFO 03-01 18:36:39 [logger.py:42] Received request cmpl-bcbe1e3e49d34d698c8e07fb50074d54-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:39 [async_llm.py:261] Added request cmpl-bcbe1e3e49d34d698c8e07fb50074d54-0. INFO 03-01 18:36:40 [logger.py:42] Received request cmpl-fc07c80af8634fddbcf7e87c601a9c2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:40 [async_llm.py:261] Added request cmpl-fc07c80af8634fddbcf7e87c601a9c2c-0. INFO 03-01 18:36:41 [logger.py:42] Received request cmpl-0cab7d28ecb245e293dd75e9cf121ea4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:41 [async_llm.py:261] Added request cmpl-0cab7d28ecb245e293dd75e9cf121ea4-0. INFO 03-01 18:36:41 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:36:43 [logger.py:42] Received request cmpl-e710da7a14c0456c91d990df46183078-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:43 [async_llm.py:261] Added request cmpl-e710da7a14c0456c91d990df46183078-0. INFO 03-01 18:36:44 [logger.py:42] Received request cmpl-c48c8ea36db4434c8514855955d2d117-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:44 [async_llm.py:261] Added request cmpl-c48c8ea36db4434c8514855955d2d117-0. INFO 03-01 18:36:45 [logger.py:42] Received request cmpl-b00d3609cace4aa689e273501cb18d51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:45 [async_llm.py:261] Added request cmpl-b00d3609cace4aa689e273501cb18d51-0. INFO 03-01 18:36:46 [logger.py:42] Received request cmpl-586535e417ad4859bb7f6ac6793d62cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:46 [async_llm.py:261] Added request cmpl-586535e417ad4859bb7f6ac6793d62cf-0. INFO 03-01 18:36:47 [logger.py:42] Received request cmpl-169d9e5be5704f3b91ef036be14d0940-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:47 [async_llm.py:261] Added request cmpl-169d9e5be5704f3b91ef036be14d0940-0. INFO 03-01 18:36:48 [logger.py:42] Received request cmpl-81f49036e4ed42eb9e3e3774a3b8204b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:48 [async_llm.py:261] Added request cmpl-81f49036e4ed42eb9e3e3774a3b8204b-0. INFO 03-01 18:36:50 [logger.py:42] Received request cmpl-0b906e770ea74b56ab577a1b09c7702a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:50 [async_llm.py:261] Added request cmpl-0b906e770ea74b56ab577a1b09c7702a-0. INFO 03-01 18:36:51 [logger.py:42] Received request cmpl-7b5d20cffe034e668169f07c9128a5a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:51 [async_llm.py:261] Added request cmpl-7b5d20cffe034e668169f07c9128a5a7-0. INFO 03-01 18:36:51 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:36:52 [logger.py:42] Received request cmpl-b0aba29e04304cf9b4b94359e59d37cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:52 [async_llm.py:261] Added request cmpl-b0aba29e04304cf9b4b94359e59d37cb-0. INFO 03-01 18:36:53 [logger.py:42] Received request cmpl-ec5a35aa75834dc59b6ca5eddb8c8a46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:53 [async_llm.py:261] Added request cmpl-ec5a35aa75834dc59b6ca5eddb8c8a46-0. INFO 03-01 18:36:54 [logger.py:42] Received request cmpl-3bacc38616254802b6a035603db14e7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:54 [async_llm.py:261] Added request cmpl-3bacc38616254802b6a035603db14e7b-0. INFO 03-01 18:36:55 [logger.py:42] Received request cmpl-f6c2e2eca2ee455bb75b98c920cc605d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:55 [async_llm.py:261] Added request cmpl-f6c2e2eca2ee455bb75b98c920cc605d-0. INFO 03-01 18:36:57 [logger.py:42] Received request cmpl-d0ad633dea734b16bb310793a00f05dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:57 [async_llm.py:261] Added request cmpl-d0ad633dea734b16bb310793a00f05dc-0. INFO 03-01 18:36:58 [logger.py:42] Received request cmpl-d7b45ea27f834945bd14fe6d52a591b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:58 [async_llm.py:261] Added request cmpl-d7b45ea27f834945bd14fe6d52a591b9-0. INFO 03-01 18:36:59 [logger.py:42] Received request cmpl-41a78dc258e642a881b41f637a6736ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:36:59 [async_llm.py:261] Added request cmpl-41a78dc258e642a881b41f637a6736ad-0. INFO 03-01 18:37:00 [logger.py:42] Received request cmpl-04f5b2aefdcd48039b8c50657b6df6dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:00 [async_llm.py:261] Added request cmpl-04f5b2aefdcd48039b8c50657b6df6dd-0. INFO 03-01 18:37:01 [logger.py:42] Received request cmpl-bc1765007e3549c69030ee8051328405-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:01 [async_llm.py:261] Added request cmpl-bc1765007e3549c69030ee8051328405-0. INFO 03-01 18:37:01 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:37:02 [logger.py:42] Received request cmpl-60b28031e18344c79dcac8e31784efb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:02 [async_llm.py:261] Added request cmpl-60b28031e18344c79dcac8e31784efb4-0. INFO 03-01 18:37:04 [logger.py:42] Received request cmpl-12f828c999ca4b02bda2b961a66075e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:04 [async_llm.py:261] Added request cmpl-12f828c999ca4b02bda2b961a66075e7-0. INFO 03-01 18:37:05 [logger.py:42] Received request cmpl-26ba00a536c54bc19f60408ed1bc9785-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:05 [async_llm.py:261] Added request cmpl-26ba00a536c54bc19f60408ed1bc9785-0. INFO 03-01 18:37:06 [logger.py:42] Received request cmpl-03f6741828bc49bdb805440b2dafd64f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:06 [async_llm.py:261] Added request cmpl-03f6741828bc49bdb805440b2dafd64f-0. INFO 03-01 18:37:07 [logger.py:42] Received request cmpl-a08205b67b8d414c9c58c0422c2d7d27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:07 [async_llm.py:261] Added request cmpl-a08205b67b8d414c9c58c0422c2d7d27-0. INFO 03-01 18:37:08 [logger.py:42] Received request cmpl-cc1e4b28fae947379c2181a3b00cb16f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:08 [async_llm.py:261] Added request cmpl-cc1e4b28fae947379c2181a3b00cb16f-0. INFO 03-01 18:37:09 [logger.py:42] Received request cmpl-00f62d21b2904a00a5518a4cd44fa2d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:09 [async_llm.py:261] Added request cmpl-00f62d21b2904a00a5518a4cd44fa2d5-0. INFO 03-01 18:37:10 [logger.py:42] Received request cmpl-784575182b3c45b09c075426cc71900b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:10 [async_llm.py:261] Added request cmpl-784575182b3c45b09c075426cc71900b-0. INFO 03-01 18:37:11 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:37:12 [logger.py:42] Received request cmpl-ff6fb534f24742c5aa6afa133f4c9eae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:12 [async_llm.py:261] Added request cmpl-ff6fb534f24742c5aa6afa133f4c9eae-0. INFO 03-01 18:37:13 [logger.py:42] Received request cmpl-9c786f620cc145a4865a9092b4407489-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:13 [async_llm.py:261] Added request cmpl-9c786f620cc145a4865a9092b4407489-0. INFO 03-01 18:37:14 [logger.py:42] Received request cmpl-bda082ea3e744b44985cb30fc6b6bd9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:14 [async_llm.py:261] Added request cmpl-bda082ea3e744b44985cb30fc6b6bd9f-0. INFO 03-01 18:37:15 [logger.py:42] Received request cmpl-b8cb716991f64a9d85d5ecefb3b41829-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:15 [async_llm.py:261] Added request cmpl-b8cb716991f64a9d85d5ecefb3b41829-0. INFO 03-01 18:37:16 [logger.py:42] Received request cmpl-7a94b62d0b2946158cd4d87a835f3fb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:16 [async_llm.py:261] Added request cmpl-7a94b62d0b2946158cd4d87a835f3fb8-0. INFO 03-01 18:37:17 [logger.py:42] Received request cmpl-66128823b3924a84b057d4c8f15302e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:17 [async_llm.py:261] Added request cmpl-66128823b3924a84b057d4c8f15302e5-0. INFO 03-01 18:37:19 [logger.py:42] Received request cmpl-01b64f46951b4b91b4c12c979c7027fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:19 [async_llm.py:261] Added request cmpl-01b64f46951b4b91b4c12c979c7027fd-0. INFO 03-01 18:37:20 [logger.py:42] Received request cmpl-1c07ff537f36473b94d98bbb04dacfbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:20 [async_llm.py:261] Added request cmpl-1c07ff537f36473b94d98bbb04dacfbc-0. INFO 03-01 18:37:21 [logger.py:42] Received request cmpl-1280031e79034689a7c1e184756d03d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:21 [async_llm.py:261] Added request cmpl-1280031e79034689a7c1e184756d03d2-0. INFO 03-01 18:37:21 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:37:22 [logger.py:42] Received request cmpl-1b47969c36104b1f84a0354f57e485e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:22 [async_llm.py:261] Added request cmpl-1b47969c36104b1f84a0354f57e485e1-0. INFO 03-01 18:37:23 [logger.py:42] Received request cmpl-0abaedb9625342df9c8deaed83db6b26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:23 [async_llm.py:261] Added request cmpl-0abaedb9625342df9c8deaed83db6b26-0. INFO 03-01 18:37:24 [logger.py:42] Received request cmpl-34f7929e5a1647b6b34c48975316549f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:24 [async_llm.py:261] Added request cmpl-34f7929e5a1647b6b34c48975316549f-0. INFO 03-01 18:37:26 [logger.py:42] Received request cmpl-ae59bb7ca2bf46b88b89109ad867269b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:26 [async_llm.py:261] Added request cmpl-ae59bb7ca2bf46b88b89109ad867269b-0. INFO 03-01 18:37:27 [logger.py:42] Received request cmpl-31d74d9785ad4896b9a8ee18c214ea1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:27 [async_llm.py:261] Added request cmpl-31d74d9785ad4896b9a8ee18c214ea1e-0. INFO 03-01 18:37:28 [logger.py:42] Received request cmpl-1892be3182fa486a9d9ec448b8bfe3d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:28 [async_llm.py:261] Added request cmpl-1892be3182fa486a9d9ec448b8bfe3d7-0. INFO 03-01 18:37:29 [logger.py:42] Received request cmpl-29c45a7e8f7d47529f22abdb1988f63e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:29 [async_llm.py:261] Added request cmpl-29c45a7e8f7d47529f22abdb1988f63e-0. INFO 03-01 18:37:30 [logger.py:42] Received request cmpl-5379df54728242039fa82eb5b82454d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:30 [async_llm.py:261] Added request cmpl-5379df54728242039fa82eb5b82454d1-0. INFO 03-01 18:37:31 [logger.py:42] Received request cmpl-f87b9c74e18e4d26b961336b460ee89e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:31 [async_llm.py:261] Added request cmpl-f87b9c74e18e4d26b961336b460ee89e-0. INFO 03-01 18:37:31 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.4% INFO 03-01 18:37:32 [logger.py:42] Received request cmpl-efd06781fabe457da33296895590939a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:33 [async_llm.py:261] Added request cmpl-efd06781fabe457da33296895590939a-0. INFO 03-01 18:37:34 [logger.py:42] Received request cmpl-c2ae32d574234a07b0116a0e9e3c99a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:34 [async_llm.py:261] Added request cmpl-c2ae32d574234a07b0116a0e9e3c99a7-0. INFO 03-01 18:37:35 [logger.py:42] Received request cmpl-b323944ab98d401aadc644eb7a8b7eff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:35 [async_llm.py:261] Added request cmpl-b323944ab98d401aadc644eb7a8b7eff-0. INFO 03-01 18:37:36 [logger.py:42] Received request cmpl-bb534257e41c4f2a931f67131f7c7081-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:36 [async_llm.py:261] Added request cmpl-bb534257e41c4f2a931f67131f7c7081-0. INFO 03-01 18:37:37 [logger.py:42] Received request cmpl-b8c7bf00ee104037921e2dd98759967c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:37 [async_llm.py:261] Added request cmpl-b8c7bf00ee104037921e2dd98759967c-0. INFO 03-01 18:37:38 [logger.py:42] Received request cmpl-15d58d5326a04d74bf9fd4be9ada585c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:38 [async_llm.py:261] Added request cmpl-15d58d5326a04d74bf9fd4be9ada585c-0. INFO 03-01 18:37:39 [logger.py:42] Received request cmpl-294bf9ab82ef4c8fbbe898dcb4e53d65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:39 [async_llm.py:261] Added request cmpl-294bf9ab82ef4c8fbbe898dcb4e53d65-0. INFO 03-01 18:37:41 [logger.py:42] Received request cmpl-cd9a25234b764337880bcf13e7c7e900-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:41 [async_llm.py:261] Added request cmpl-cd9a25234b764337880bcf13e7c7e900-0. INFO 03-01 18:37:41 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.4% INFO 03-01 18:37:42 [logger.py:42] Received request cmpl-6e7e54d2b89a4fdaa64408f5e1c2b31c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:42 [async_llm.py:261] Added request cmpl-6e7e54d2b89a4fdaa64408f5e1c2b31c-0. INFO 03-01 18:37:43 [logger.py:42] Received request cmpl-d6616a62a52d4465a8e99241753f6674-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:43 [async_llm.py:261] Added request cmpl-d6616a62a52d4465a8e99241753f6674-0. INFO 03-01 18:37:44 [logger.py:42] Received request cmpl-c12e3c67e17c4e26856d7e05b8f5e6ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:44 [async_llm.py:261] Added request cmpl-c12e3c67e17c4e26856d7e05b8f5e6ea-0. INFO 03-01 18:37:45 [logger.py:42] Received request cmpl-7df5195a96fc44a7bec60f5ef762e5db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:45 [async_llm.py:261] Added request cmpl-7df5195a96fc44a7bec60f5ef762e5db-0. INFO 03-01 18:37:47 [logger.py:42] Received request cmpl-0755a87df1634bccab1f49e0ee647e96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:47 [async_llm.py:261] Added request cmpl-0755a87df1634bccab1f49e0ee647e96-0. INFO 03-01 18:37:48 [logger.py:42] Received request cmpl-d6a4419cf50b49c0ac129158224f4e00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:48 [async_llm.py:261] Added request cmpl-d6a4419cf50b49c0ac129158224f4e00-0. INFO 03-01 18:37:49 [logger.py:42] Received request cmpl-510b83265ec54956b68134e38978e837-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:49 [async_llm.py:261] Added request cmpl-510b83265ec54956b68134e38978e837-0. INFO 03-01 18:37:50 [logger.py:42] Received request cmpl-5a740fc1786d43aeb99b2805aceb5a53-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:50 [async_llm.py:261] Added request cmpl-5a740fc1786d43aeb99b2805aceb5a53-0. INFO 03-01 18:37:51 [logger.py:42] Received request cmpl-785643b388d943deba6d3d91f8469319-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:51 [async_llm.py:261] Added request cmpl-785643b388d943deba6d3d91f8469319-0. INFO 03-01 18:37:51 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:37:52 [logger.py:42] Received request cmpl-71b6fc1064f4484285bb6e8e40df4fd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:52 [async_llm.py:261] Added request cmpl-71b6fc1064f4484285bb6e8e40df4fd8-0. INFO 03-01 18:37:54 [logger.py:42] Received request cmpl-bb54983d2c0b4b6fb495bcdd67436a3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:54 [async_llm.py:261] Added request cmpl-bb54983d2c0b4b6fb495bcdd67436a3c-0. INFO 03-01 18:37:55 [logger.py:42] Received request cmpl-ff6def9f55ad433f92108c0e47540867-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:55 [async_llm.py:261] Added request cmpl-ff6def9f55ad433f92108c0e47540867-0. INFO 03-01 18:37:56 [logger.py:42] Received request cmpl-f66794be1d0447b0b2677f19fa7fb8d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:56 [async_llm.py:261] Added request cmpl-f66794be1d0447b0b2677f19fa7fb8d7-0. INFO 03-01 18:37:57 [logger.py:42] Received request cmpl-8b0a2638592a42519c97f40d67e9d36e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:57 [async_llm.py:261] Added request cmpl-8b0a2638592a42519c97f40d67e9d36e-0. INFO 03-01 18:37:58 [logger.py:42] Received request cmpl-20277cc26dea4fd4a4ae0cbf3dc31cbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:58 [async_llm.py:261] Added request cmpl-20277cc26dea4fd4a4ae0cbf3dc31cbe-0. INFO 03-01 18:37:59 [logger.py:42] Received request cmpl-26b6d643ada745fda1381c704d0afd01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:37:59 [async_llm.py:261] Added request cmpl-26b6d643ada745fda1381c704d0afd01-0. INFO 03-01 18:38:01 [logger.py:42] Received request cmpl-a4aafcf0563947519d4ff090eaa08708-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:01 [async_llm.py:261] Added request cmpl-a4aafcf0563947519d4ff090eaa08708-0. INFO 03-01 18:38:01 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:38:02 [logger.py:42] Received request cmpl-6bb8e1fba09f4c7ab2380599adbf01a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:02 [async_llm.py:261] Added request cmpl-6bb8e1fba09f4c7ab2380599adbf01a1-0. INFO 03-01 18:38:03 [logger.py:42] Received request cmpl-230c57392a624992a2ce1b4e703415a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:03 [async_llm.py:261] Added request cmpl-230c57392a624992a2ce1b4e703415a7-0. INFO 03-01 18:38:04 [logger.py:42] Received request cmpl-c749d78e3df44586aa2b3709b39c0278-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:04 [async_llm.py:261] Added request cmpl-c749d78e3df44586aa2b3709b39c0278-0. INFO 03-01 18:38:05 [logger.py:42] Received request cmpl-8e45c9471f5a4b01bbfb67da27b4a409-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:05 [async_llm.py:261] Added request cmpl-8e45c9471f5a4b01bbfb67da27b4a409-0. INFO 03-01 18:38:06 [logger.py:42] Received request cmpl-53bd9ecec03340dd8ce1239071c15df0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:06 [async_llm.py:261] Added request cmpl-53bd9ecec03340dd8ce1239071c15df0-0. INFO 03-01 18:38:08 [logger.py:42] Received request cmpl-f17c5fe217bb40f096fa262e63dc49fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:08 [async_llm.py:261] Added request cmpl-f17c5fe217bb40f096fa262e63dc49fb-0. INFO 03-01 18:38:09 [logger.py:42] Received request cmpl-64f3b4e9ffd94ebb87d67778586a9103-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:09 [async_llm.py:261] Added request cmpl-64f3b4e9ffd94ebb87d67778586a9103-0. INFO 03-01 18:38:10 [logger.py:42] Received request cmpl-3722f325b30a44a89268253b506609d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:10 [async_llm.py:261] Added request cmpl-3722f325b30a44a89268253b506609d2-0. INFO 03-01 18:38:11 [logger.py:42] Received request cmpl-14aa62ed06b947c09b913916f883d572-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:11 [async_llm.py:261] Added request cmpl-14aa62ed06b947c09b913916f883d572-0. INFO 03-01 18:38:11 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:38:12 [logger.py:42] Received request cmpl-f75e913f6e6748f78566b48b5c43f253-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:12 [async_llm.py:261] Added request cmpl-f75e913f6e6748f78566b48b5c43f253-0. INFO 03-01 18:38:13 [logger.py:42] Received request cmpl-bb8d47e9c924490c957e0bf096be1986-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:13 [async_llm.py:261] Added request cmpl-bb8d47e9c924490c957e0bf096be1986-0. INFO 03-01 18:38:15 [logger.py:42] Received request cmpl-6a1357f1292342dca7e5ce1a4252c1e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:15 [async_llm.py:261] Added request cmpl-6a1357f1292342dca7e5ce1a4252c1e3-0. INFO 03-01 18:38:16 [logger.py:42] Received request cmpl-2440b63014244768bdc300464a3a7fb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:16 [async_llm.py:261] Added request cmpl-2440b63014244768bdc300464a3a7fb5-0. INFO 03-01 18:38:17 [logger.py:42] Received request cmpl-c73e6321f70146b5b0aac40c1f180e97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:17 [async_llm.py:261] Added request cmpl-c73e6321f70146b5b0aac40c1f180e97-0. INFO 03-01 18:38:18 [logger.py:42] Received request cmpl-5a02239460ab41f0881e9c1300f34787-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:18 [async_llm.py:261] Added request cmpl-5a02239460ab41f0881e9c1300f34787-0. INFO 03-01 18:38:19 [logger.py:42] Received request cmpl-19903664f9054c5594a5a6a1b1500f10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:19 [async_llm.py:261] Added request cmpl-19903664f9054c5594a5a6a1b1500f10-0. INFO 03-01 18:38:21 [logger.py:42] Received request cmpl-e23b774c1ddd47c38bbbd26e06ef64dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:21 [async_llm.py:261] Added request cmpl-e23b774c1ddd47c38bbbd26e06ef64dd-0. INFO 03-01 18:38:21 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:38:22 [logger.py:42] Received request cmpl-42e545f3fe4a4ae982606e8cbe88255f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:22 [async_llm.py:261] Added request cmpl-42e545f3fe4a4ae982606e8cbe88255f-0. INFO 03-01 18:38:23 [logger.py:42] Received request cmpl-499acb688cf24e84bcd0eb1b094664a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:23 [async_llm.py:261] Added request cmpl-499acb688cf24e84bcd0eb1b094664a4-0. INFO 03-01 18:38:24 [logger.py:42] Received request cmpl-8e693943e2e3472abd48a6e29aa0e21b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:24 [async_llm.py:261] Added request cmpl-8e693943e2e3472abd48a6e29aa0e21b-0. INFO 03-01 18:38:25 [logger.py:42] Received request cmpl-1dc646f8621b49b2bd4d17654bb35268-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:25 [async_llm.py:261] Added request cmpl-1dc646f8621b49b2bd4d17654bb35268-0. INFO 03-01 18:38:26 [logger.py:42] Received request cmpl-310d282fa33543c7ac3425fd46bd704b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:26 [async_llm.py:261] Added request cmpl-310d282fa33543c7ac3425fd46bd704b-0. INFO 03-01 18:38:28 [logger.py:42] Received request cmpl-40d73c650a1e4862887672be1e781255-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:28 [async_llm.py:261] Added request cmpl-40d73c650a1e4862887672be1e781255-0. INFO 03-01 18:38:29 [logger.py:42] Received request cmpl-1626551c1f8346288ab0e8156e4ebbaa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:29 [async_llm.py:261] Added request cmpl-1626551c1f8346288ab0e8156e4ebbaa-0. INFO 03-01 18:38:30 [logger.py:42] Received request cmpl-7bf5eb55c212435b8a09d675c026250c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:30 [async_llm.py:261] Added request cmpl-7bf5eb55c212435b8a09d675c026250c-0. INFO 03-01 18:38:31 [logger.py:42] Received request cmpl-832ce5cc4e6c4e16b36136650bde69f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:31 [async_llm.py:261] Added request cmpl-832ce5cc4e6c4e16b36136650bde69f8-0. INFO 03-01 18:38:31 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:38:32 [logger.py:42] Received request cmpl-45a85d45f03e4878b83dcda19faaafbb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:32 [async_llm.py:261] Added request cmpl-45a85d45f03e4878b83dcda19faaafbb-0. INFO 03-01 18:38:33 [logger.py:42] Received request cmpl-9e7b90aafc8642679d4e954fefe698cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:33 [async_llm.py:261] Added request cmpl-9e7b90aafc8642679d4e954fefe698cb-0. INFO 03-01 18:38:35 [logger.py:42] Received request cmpl-208bd741bee8446f9a6b8c1d5363952e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:35 [async_llm.py:261] Added request cmpl-208bd741bee8446f9a6b8c1d5363952e-0. INFO 03-01 18:38:36 [logger.py:42] Received request cmpl-e457da081a544d4d87db4a1a51aeb2d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:36 [async_llm.py:261] Added request cmpl-e457da081a544d4d87db4a1a51aeb2d6-0. INFO 03-01 18:38:37 [logger.py:42] Received request cmpl-aaf7331ef81746e5b909f948db0a46ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:37 [async_llm.py:261] Added request cmpl-aaf7331ef81746e5b909f948db0a46ca-0. INFO 03-01 18:38:38 [logger.py:42] Received request cmpl-c7cfb26894584249b9fc8eadb83b9537-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:38 [async_llm.py:261] Added request cmpl-c7cfb26894584249b9fc8eadb83b9537-0. INFO 03-01 18:38:39 [logger.py:42] Received request cmpl-b28bcc08a4694b75911171512a076d26-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:39 [async_llm.py:261] Added request cmpl-b28bcc08a4694b75911171512a076d26-0. INFO 03-01 18:38:40 [logger.py:42] Received request cmpl-b904b308e02f4ced92b1130b2aeee9d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:40 [async_llm.py:261] Added request cmpl-b904b308e02f4ced92b1130b2aeee9d9-0. INFO 03-01 18:38:41 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:38:42 [logger.py:42] Received request cmpl-aa410c1f20524d11a1f6e7a27a7466db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:42 [async_llm.py:261] Added request cmpl-aa410c1f20524d11a1f6e7a27a7466db-0. INFO 03-01 18:38:43 [logger.py:42] Received request cmpl-2b19183eb6cb48b299343285d9fb8931-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:43 [async_llm.py:261] Added request cmpl-2b19183eb6cb48b299343285d9fb8931-0. INFO 03-01 18:38:44 [logger.py:42] Received request cmpl-ed8607f5a9b24580b4349ff7b5af57cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:44 [async_llm.py:261] Added request cmpl-ed8607f5a9b24580b4349ff7b5af57cc-0. INFO 03-01 18:38:45 [logger.py:42] Received request cmpl-917234f2d83e4a869f9c067cf5322aaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:45 [async_llm.py:261] Added request cmpl-917234f2d83e4a869f9c067cf5322aaf-0. INFO 03-01 18:38:46 [logger.py:42] Received request cmpl-54fcc21e25da46379e89628e201e9c5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:46 [async_llm.py:261] Added request cmpl-54fcc21e25da46379e89628e201e9c5f-0. INFO 03-01 18:38:47 [logger.py:42] Received request cmpl-62c264284df345ca93bdb0eb021fc810-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:47 [async_llm.py:261] Added request cmpl-62c264284df345ca93bdb0eb021fc810-0. INFO 03-01 18:38:48 [logger.py:42] Received request cmpl-d06be948f902429eaafe38c5a1c1004a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:48 [async_llm.py:261] Added request cmpl-d06be948f902429eaafe38c5a1c1004a-0. INFO 03-01 18:38:50 [logger.py:42] Received request cmpl-6f8717e4cca94c739cc3ec9b93cf0d41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:50 [async_llm.py:261] Added request cmpl-6f8717e4cca94c739cc3ec9b93cf0d41-0. INFO 03-01 18:38:51 [logger.py:42] Received request cmpl-617b628f9b944ee9907f2e54a8346554-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:51 [async_llm.py:261] Added request cmpl-617b628f9b944ee9907f2e54a8346554-0. INFO 03-01 18:38:51 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:38:52 [logger.py:42] Received request cmpl-cca484940ab94fb5b334bd7f5c39d9a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:52 [async_llm.py:261] Added request cmpl-cca484940ab94fb5b334bd7f5c39d9a1-0. INFO 03-01 18:38:53 [logger.py:42] Received request cmpl-7948a5ccc5954eff908275102be6022c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:53 [async_llm.py:261] Added request cmpl-7948a5ccc5954eff908275102be6022c-0. INFO 03-01 18:38:54 [logger.py:42] Received request cmpl-dd6ca05107b34369be451cc104133534-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:54 [async_llm.py:261] Added request cmpl-dd6ca05107b34369be451cc104133534-0. INFO 03-01 18:38:55 [logger.py:42] Received request cmpl-3f851b497caf41348ebe1338a0ec4f81-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:55 [async_llm.py:261] Added request cmpl-3f851b497caf41348ebe1338a0ec4f81-0. INFO 03-01 18:38:57 [logger.py:42] Received request cmpl-91708970ca404c188b435e5cc9159211-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:57 [async_llm.py:261] Added request cmpl-91708970ca404c188b435e5cc9159211-0. INFO 03-01 18:38:58 [logger.py:42] Received request cmpl-4170eb20818e4235bed669ab6cfb2f68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:58 [async_llm.py:261] Added request cmpl-4170eb20818e4235bed669ab6cfb2f68-0. INFO 03-01 18:38:59 [logger.py:42] Received request cmpl-31c8ee845f794c62b80224140f59a647-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:38:59 [async_llm.py:261] Added request cmpl-31c8ee845f794c62b80224140f59a647-0. INFO 03-01 18:39:00 [logger.py:42] Received request cmpl-b6cbd126055243ab923227454ced9db6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:00 [async_llm.py:261] Added request cmpl-b6cbd126055243ab923227454ced9db6-0. INFO 03-01 18:39:01 [logger.py:42] Received request cmpl-edecbea0c31b4a28b4a5bac88b75ca79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:01 [async_llm.py:261] Added request cmpl-edecbea0c31b4a28b4a5bac88b75ca79-0. INFO 03-01 18:39:01 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:39:02 [logger.py:42] Received request cmpl-2247bb4c476f46d9821291433ece7492-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:02 [async_llm.py:261] Added request cmpl-2247bb4c476f46d9821291433ece7492-0. INFO 03-01 18:39:04 [logger.py:42] Received request cmpl-e90ca776d8ad4d8fb2116ef8409b323f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:04 [async_llm.py:261] Added request cmpl-e90ca776d8ad4d8fb2116ef8409b323f-0. INFO 03-01 18:39:05 [logger.py:42] Received request cmpl-dd2c75ff5c1a44d1a12b9cc554ec3897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:05 [async_llm.py:261] Added request cmpl-dd2c75ff5c1a44d1a12b9cc554ec3897-0. INFO 03-01 18:39:06 [logger.py:42] Received request cmpl-827de3ff3f0f443db8c9066e0fef281b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:06 [async_llm.py:261] Added request cmpl-827de3ff3f0f443db8c9066e0fef281b-0. INFO 03-01 18:39:07 [logger.py:42] Received request cmpl-1d905e93c1b04d9eab06878e09da2155-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:07 [async_llm.py:261] Added request cmpl-1d905e93c1b04d9eab06878e09da2155-0. INFO 03-01 18:39:08 [logger.py:42] Received request cmpl-5f70f353d630466a8cd5a54f885552a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:08 [async_llm.py:261] Added request cmpl-5f70f353d630466a8cd5a54f885552a3-0. INFO 03-01 18:39:09 [logger.py:42] Received request cmpl-b63e8c7fae9243068366a7ee63be6e98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:09 [async_llm.py:261] Added request cmpl-b63e8c7fae9243068366a7ee63be6e98-0. INFO 03-01 18:39:10 [logger.py:42] Received request cmpl-4924fe1effea48f1b2d2b327fe54595c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:10 [async_llm.py:261] Added request cmpl-4924fe1effea48f1b2d2b327fe54595c-0. INFO 03-01 18:39:11 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:39:12 [logger.py:42] Received request cmpl-52d1337bcc9746439048f95d6aa1ee83-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:12 [async_llm.py:261] Added request cmpl-52d1337bcc9746439048f95d6aa1ee83-0. INFO 03-01 18:39:13 [logger.py:42] Received request cmpl-6f2452606b23410684899b5a695d7142-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:13 [async_llm.py:261] Added request cmpl-6f2452606b23410684899b5a695d7142-0. INFO 03-01 18:39:14 [logger.py:42] Received request cmpl-d33ce77cdac24835bd8a474ac1cad04b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:14 [async_llm.py:261] Added request cmpl-d33ce77cdac24835bd8a474ac1cad04b-0. INFO 03-01 18:39:15 [logger.py:42] Received request cmpl-63674f9c1d9d4a428eff81784f59f4b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:15 [async_llm.py:261] Added request cmpl-63674f9c1d9d4a428eff81784f59f4b4-0. INFO 03-01 18:39:16 [logger.py:42] Received request cmpl-12d4393330eb48cca36db128b407d998-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:16 [async_llm.py:261] Added request cmpl-12d4393330eb48cca36db128b407d998-0. INFO 03-01 18:39:18 [logger.py:42] Received request cmpl-d85fa338b74c42e0834f679596ab3d5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:18 [async_llm.py:261] Added request cmpl-d85fa338b74c42e0834f679596ab3d5c-0. INFO 03-01 18:39:19 [logger.py:42] Received request cmpl-29dd100f66e24b16abb480af94c51e6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:19 [async_llm.py:261] Added request cmpl-29dd100f66e24b16abb480af94c51e6d-0. INFO 03-01 18:39:20 [logger.py:42] Received request cmpl-e984fc56ae884d219fbc054421e3abe8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:20 [async_llm.py:261] Added request cmpl-e984fc56ae884d219fbc054421e3abe8-0. INFO 03-01 18:39:21 [logger.py:42] Received request cmpl-8ddfc74067e144799a9df8ed217fe985-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:21 [async_llm.py:261] Added request cmpl-8ddfc74067e144799a9df8ed217fe985-0. INFO 03-01 18:39:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:39:22 [logger.py:42] Received request cmpl-b8f357f45d5644f59c1b2b9e919769ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:22 [async_llm.py:261] Added request cmpl-b8f357f45d5644f59c1b2b9e919769ce-0. INFO 03-01 18:39:24 [logger.py:42] Received request cmpl-f0f7bf4e942e40039166708fcbf67fd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:24 [async_llm.py:261] Added request cmpl-f0f7bf4e942e40039166708fcbf67fd9-0. INFO 03-01 18:39:25 [logger.py:42] Received request cmpl-fdaedc08d9564490811e0639cd12fa24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:25 [async_llm.py:261] Added request cmpl-fdaedc08d9564490811e0639cd12fa24-0. INFO 03-01 18:39:26 [logger.py:42] Received request cmpl-4747ffb25afb4ffc99c58f3d19fc93a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:26 [async_llm.py:261] Added request cmpl-4747ffb25afb4ffc99c58f3d19fc93a8-0. INFO 03-01 18:39:27 [logger.py:42] Received request cmpl-2947d9c6814345bb9a9cf40d62442f1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:27 [async_llm.py:261] Added request cmpl-2947d9c6814345bb9a9cf40d62442f1e-0. INFO 03-01 18:39:28 [logger.py:42] Received request cmpl-53c1237862ec48a0919da3fe9aec17be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:28 [async_llm.py:261] Added request cmpl-53c1237862ec48a0919da3fe9aec17be-0. INFO 03-01 18:39:29 [logger.py:42] Received request cmpl-3c2c6cd7a9294ec8ab7d0f8b6d8e97ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:29 [async_llm.py:261] Added request cmpl-3c2c6cd7a9294ec8ab7d0f8b6d8e97ff-0. INFO 03-01 18:39:31 [logger.py:42] Received request cmpl-3cf050d526394cd7b344c31ae0916a8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:31 [async_llm.py:261] Added request cmpl-3cf050d526394cd7b344c31ae0916a8a-0. INFO 03-01 18:39:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:39:32 [logger.py:42] Received request cmpl-cd2c77488d094bbda05566e873079383-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:32 [async_llm.py:261] Added request cmpl-cd2c77488d094bbda05566e873079383-0. INFO 03-01 18:39:33 [logger.py:42] Received request cmpl-2c1892ea579a4ebc8bd6f2cebfc76ea2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:33 [async_llm.py:261] Added request cmpl-2c1892ea579a4ebc8bd6f2cebfc76ea2-0. INFO 03-01 18:39:34 [logger.py:42] Received request cmpl-20f82d3993c541a490fe53bb5dd84360-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:34 [async_llm.py:261] Added request cmpl-20f82d3993c541a490fe53bb5dd84360-0. INFO 03-01 18:39:35 [logger.py:42] Received request cmpl-591771245ec941928c504106cfd48d15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:35 [async_llm.py:261] Added request cmpl-591771245ec941928c504106cfd48d15-0. INFO 03-01 18:39:36 [logger.py:42] Received request cmpl-9a615cf8ec3a451d8858bbe187f4d5fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:36 [async_llm.py:261] Added request cmpl-9a615cf8ec3a451d8858bbe187f4d5fb-0. INFO 03-01 18:39:38 [logger.py:42] Received request cmpl-a4f77bf065154f0099bc6217495c8c13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:38 [async_llm.py:261] Added request cmpl-a4f77bf065154f0099bc6217495c8c13-0. INFO 03-01 18:39:39 [logger.py:42] Received request cmpl-5b1c499ade7e42b6a8bc690cac9f2a97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:39 [async_llm.py:261] Added request cmpl-5b1c499ade7e42b6a8bc690cac9f2a97-0. INFO 03-01 18:39:40 [logger.py:42] Received request cmpl-20ef6e1101364aeaa85a5cbdea6ca5b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:40 [async_llm.py:261] Added request cmpl-20ef6e1101364aeaa85a5cbdea6ca5b5-0. INFO 03-01 18:39:41 [logger.py:42] Received request cmpl-b03da08814f24a70bd58287babf16c6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:41 [async_llm.py:261] Added request cmpl-b03da08814f24a70bd58287babf16c6d-0. INFO 03-01 18:39:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:39:42 [logger.py:42] Received request cmpl-1b915e2a09f44f2f816b8294d3269881-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:42 [async_llm.py:261] Added request cmpl-1b915e2a09f44f2f816b8294d3269881-0. INFO 03-01 18:39:43 [logger.py:42] Received request cmpl-39078d26bf2c4001947d5793714deac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:43 [async_llm.py:261] Added request cmpl-39078d26bf2c4001947d5793714deac6-0. INFO 03-01 18:39:44 [logger.py:42] Received request cmpl-9115e7dd8bbc45ca9a60812874ae3369-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:44 [async_llm.py:261] Added request cmpl-9115e7dd8bbc45ca9a60812874ae3369-0. INFO 03-01 18:39:46 [logger.py:42] Received request cmpl-dd7e9397990a4019bd9e6eb397b47816-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:46 [async_llm.py:261] Added request cmpl-dd7e9397990a4019bd9e6eb397b47816-0. INFO 03-01 18:39:47 [logger.py:42] Received request cmpl-04436bc0959d44e491b0079bb8361b50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:47 [async_llm.py:261] Added request cmpl-04436bc0959d44e491b0079bb8361b50-0. INFO 03-01 18:39:48 [logger.py:42] Received request cmpl-3e1d48dec8ff4437880a9878cad15ee0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:48 [async_llm.py:261] Added request cmpl-3e1d48dec8ff4437880a9878cad15ee0-0. INFO 03-01 18:39:49 [logger.py:42] Received request cmpl-e5ba483949fa4d9da2cabb8a1d041a65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:49 [async_llm.py:261] Added request cmpl-e5ba483949fa4d9da2cabb8a1d041a65-0. INFO 03-01 18:39:50 [logger.py:42] Received request cmpl-e54553b9583c4dd38be37ddc45ac011b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:50 [async_llm.py:261] Added request cmpl-e54553b9583c4dd38be37ddc45ac011b-0. INFO 03-01 18:39:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:39:52 [logger.py:42] Received request cmpl-d5898226cae4485c89babc86c74209c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:52 [async_llm.py:261] Added request cmpl-d5898226cae4485c89babc86c74209c5-0. INFO 03-01 18:39:53 [logger.py:42] Received request cmpl-e85a7cf4048d4a74a4101ef830c181b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:53 [async_llm.py:261] Added request cmpl-e85a7cf4048d4a74a4101ef830c181b4-0. INFO 03-01 18:39:54 [logger.py:42] Received request cmpl-ea1e3d675efd4dc6bc91aa7da6e18330-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:54 [async_llm.py:261] Added request cmpl-ea1e3d675efd4dc6bc91aa7da6e18330-0. INFO 03-01 18:39:55 [logger.py:42] Received request cmpl-2bd10ff333a44eebaecb6117b458d327-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:55 [async_llm.py:261] Added request cmpl-2bd10ff333a44eebaecb6117b458d327-0. INFO 03-01 18:39:56 [logger.py:42] Received request cmpl-40a6257e3a3e493f90e25eb9b24fc0dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:56 [async_llm.py:261] Added request cmpl-40a6257e3a3e493f90e25eb9b24fc0dd-0. INFO 03-01 18:39:57 [logger.py:42] Received request cmpl-1c0bc4eec22141a4afdf4e08a3eb0766-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:57 [async_llm.py:261] Added request cmpl-1c0bc4eec22141a4afdf4e08a3eb0766-0. INFO 03-01 18:39:59 [logger.py:42] Received request cmpl-0490759c99454d68b2c612cc7e1c8456-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:39:59 [async_llm.py:261] Added request cmpl-0490759c99454d68b2c612cc7e1c8456-0. INFO 03-01 18:40:00 [logger.py:42] Received request cmpl-778c4c13a2684cc5981d642130741d19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:00 [async_llm.py:261] Added request cmpl-778c4c13a2684cc5981d642130741d19-0. INFO 03-01 18:40:01 [logger.py:42] Received request cmpl-d67825d36641401295ced062175c9218-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:01 [async_llm.py:261] Added request cmpl-d67825d36641401295ced062175c9218-0. INFO 03-01 18:40:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:40:02 [logger.py:42] Received request cmpl-7785ad4375bd4a748c1500131cf04b21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:02 [async_llm.py:261] Added request cmpl-7785ad4375bd4a748c1500131cf04b21-0. INFO 03-01 18:40:03 [logger.py:42] Received request cmpl-93d8c90002d34693ac6a7b0425f9074b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:03 [async_llm.py:261] Added request cmpl-93d8c90002d34693ac6a7b0425f9074b-0. INFO 03-01 18:40:05 [logger.py:42] Received request cmpl-67c5bb0c76b147d8b81fcdd4cbfd4993-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:05 [async_llm.py:261] Added request cmpl-67c5bb0c76b147d8b81fcdd4cbfd4993-0. INFO 03-01 18:40:06 [logger.py:42] Received request cmpl-0f9badb151494f0181da6155df357f62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:06 [async_llm.py:261] Added request cmpl-0f9badb151494f0181da6155df357f62-0. INFO 03-01 18:40:07 [logger.py:42] Received request cmpl-50c39bc355db4262935cfacbf6ab6ed1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:07 [async_llm.py:261] Added request cmpl-50c39bc355db4262935cfacbf6ab6ed1-0. INFO 03-01 18:40:08 [logger.py:42] Received request cmpl-408b1bc01da44b119727c83b49f6d579-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:08 [async_llm.py:261] Added request cmpl-408b1bc01da44b119727c83b49f6d579-0. INFO 03-01 18:40:09 [logger.py:42] Received request cmpl-25ca19fe596e47d18fdd50c09522654c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:09 [async_llm.py:261] Added request cmpl-25ca19fe596e47d18fdd50c09522654c-0. INFO 03-01 18:40:10 [logger.py:42] Received request cmpl-1b471fa9438e45e2ad0fe807484190aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:10 [async_llm.py:261] Added request cmpl-1b471fa9438e45e2ad0fe807484190aa-0. INFO 03-01 18:40:12 [logger.py:42] Received request cmpl-bb2c9c81998246d4b5f8911bdd190033-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:12 [async_llm.py:261] Added request cmpl-bb2c9c81998246d4b5f8911bdd190033-0. INFO 03-01 18:40:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:40:13 [logger.py:42] Received request cmpl-6d4bb35956ef4204b800e41154845ce3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:13 [async_llm.py:261] Added request cmpl-6d4bb35956ef4204b800e41154845ce3-0. INFO 03-01 18:40:14 [logger.py:42] Received request cmpl-76abb3c01bce4ef69e6930af0ef2b0ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:14 [async_llm.py:261] Added request cmpl-76abb3c01bce4ef69e6930af0ef2b0ed-0. INFO 03-01 18:40:15 [logger.py:42] Received request cmpl-1b20a52823784893acdb9eb854329bf9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:15 [async_llm.py:261] Added request cmpl-1b20a52823784893acdb9eb854329bf9-0. INFO 03-01 18:40:16 [logger.py:42] Received request cmpl-f54a7671babd441fab1a433f2191c6c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:16 [async_llm.py:261] Added request cmpl-f54a7671babd441fab1a433f2191c6c7-0. INFO 03-01 18:40:17 [logger.py:42] Received request cmpl-6eb68ea66b8c4b4b9a59f8593f776fd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:17 [async_llm.py:261] Added request cmpl-6eb68ea66b8c4b4b9a59f8593f776fd6-0. INFO 03-01 18:40:18 [logger.py:42] Received request cmpl-0cecabc31d37427cade1f2f83eca9451-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:18 [async_llm.py:261] Added request cmpl-0cecabc31d37427cade1f2f83eca9451-0. INFO 03-01 18:40:20 [logger.py:42] Received request cmpl-a493b9f2378145c9bcee6a8fed429856-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:20 [async_llm.py:261] Added request cmpl-a493b9f2378145c9bcee6a8fed429856-0. INFO 03-01 18:40:21 [logger.py:42] Received request cmpl-af055a71e4e040e4901b15c7160dc030-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:21 [async_llm.py:261] Added request cmpl-af055a71e4e040e4901b15c7160dc030-0. INFO 03-01 18:40:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:40:22 [logger.py:42] Received request cmpl-48aecb82358646e5bcbdbfac54c474d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:22 [async_llm.py:261] Added request cmpl-48aecb82358646e5bcbdbfac54c474d4-0. INFO 03-01 18:40:23 [logger.py:42] Received request cmpl-061a194a37c649bcbe839aa0722fbd93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:23 [async_llm.py:261] Added request cmpl-061a194a37c649bcbe839aa0722fbd93-0. INFO 03-01 18:40:24 [logger.py:42] Received request cmpl-0a93a8c18a1c4e0eb320f7ef44a97c0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:24 [async_llm.py:261] Added request cmpl-0a93a8c18a1c4e0eb320f7ef44a97c0c-0. INFO 03-01 18:40:25 [logger.py:42] Received request cmpl-fca2cf6c072c41f58b264e52f8add1cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:25 [async_llm.py:261] Added request cmpl-fca2cf6c072c41f58b264e52f8add1cb-0. INFO 03-01 18:40:27 [logger.py:42] Received request cmpl-166f75106604474e8327a8abd99ffeb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:27 [async_llm.py:261] Added request cmpl-166f75106604474e8327a8abd99ffeb1-0. INFO 03-01 18:40:28 [logger.py:42] Received request cmpl-1a7bd584d08549dc8ad7e5a14820cc9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:28 [async_llm.py:261] Added request cmpl-1a7bd584d08549dc8ad7e5a14820cc9f-0. INFO 03-01 18:40:29 [logger.py:42] Received request cmpl-12d3acb44201448cae3dbd73a3232061-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:29 [async_llm.py:261] Added request cmpl-12d3acb44201448cae3dbd73a3232061-0. INFO 03-01 18:40:30 [logger.py:42] Received request cmpl-00bdc0ab21e041c9875c7e2ea26db3df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:30 [async_llm.py:261] Added request cmpl-00bdc0ab21e041c9875c7e2ea26db3df-0. INFO 03-01 18:40:31 [logger.py:42] Received request cmpl-ae1e4a3fd800427a991251eec3b6a14c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:31 [async_llm.py:261] Added request cmpl-ae1e4a3fd800427a991251eec3b6a14c-0. INFO 03-01 18:40:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:40:32 [logger.py:42] Received request cmpl-36aa8e840f254420b24e6c6fc4176436-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:32 [async_llm.py:261] Added request cmpl-36aa8e840f254420b24e6c6fc4176436-0. INFO 03-01 18:40:34 [logger.py:42] Received request cmpl-426fd9dd115247b3849bb6f736a40818-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:34 [async_llm.py:261] Added request cmpl-426fd9dd115247b3849bb6f736a40818-0. INFO 03-01 18:40:35 [logger.py:42] Received request cmpl-c8691193089e4cae9e829124f408b18b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:35 [async_llm.py:261] Added request cmpl-c8691193089e4cae9e829124f408b18b-0. INFO 03-01 18:40:36 [logger.py:42] Received request cmpl-44e729a2cc9044778eb021c813ec88a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:36 [async_llm.py:261] Added request cmpl-44e729a2cc9044778eb021c813ec88a1-0. INFO 03-01 18:40:37 [logger.py:42] Received request cmpl-54e4ef857475416fa79ec95f108155ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:37 [async_llm.py:261] Added request cmpl-54e4ef857475416fa79ec95f108155ee-0. INFO 03-01 18:40:38 [logger.py:42] Received request cmpl-97a7eff827db4990825fbde1935255b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:38 [async_llm.py:261] Added request cmpl-97a7eff827db4990825fbde1935255b7-0. INFO 03-01 18:40:39 [logger.py:42] Received request cmpl-b91d1a18fd804a59a5e392e0a5a36ff2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:39 [async_llm.py:261] Added request cmpl-b91d1a18fd804a59a5e392e0a5a36ff2-0. INFO 03-01 18:40:40 [logger.py:42] Received request cmpl-94917cb004e64162ae51ef1ad5f30e82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:40 [async_llm.py:261] Added request cmpl-94917cb004e64162ae51ef1ad5f30e82-0. INFO 03-01 18:40:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:40:42 [logger.py:42] Received request cmpl-a9f0a59572e04114a0b4bcf958c2f4cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:42 [async_llm.py:261] Added request cmpl-a9f0a59572e04114a0b4bcf958c2f4cc-0. INFO 03-01 18:40:43 [logger.py:42] Received request cmpl-0f6fdeea44144f02a1044829f2bb1a99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:43 [async_llm.py:261] Added request cmpl-0f6fdeea44144f02a1044829f2bb1a99-0. INFO 03-01 18:40:44 [logger.py:42] Received request cmpl-b4ff603b7fe54230ae8ae1fa0ee9b0de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:44 [async_llm.py:261] Added request cmpl-b4ff603b7fe54230ae8ae1fa0ee9b0de-0. INFO 03-01 18:40:45 [logger.py:42] Received request cmpl-ad3afb911dcc48ee83226f9477e8a0ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:45 [async_llm.py:261] Added request cmpl-ad3afb911dcc48ee83226f9477e8a0ec-0. INFO 03-01 18:40:46 [logger.py:42] Received request cmpl-d488bf5b833f4dd38115416a039c2a88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:46 [async_llm.py:261] Added request cmpl-d488bf5b833f4dd38115416a039c2a88-0. INFO 03-01 18:40:47 [logger.py:42] Received request cmpl-acfb131e55f443228fe625b002099dd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:47 [async_llm.py:261] Added request cmpl-acfb131e55f443228fe625b002099dd8-0. INFO 03-01 18:40:49 [logger.py:42] Received request cmpl-5116a54c2e2542f7b0ce181b4adcd7b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:49 [async_llm.py:261] Added request cmpl-5116a54c2e2542f7b0ce181b4adcd7b2-0. INFO 03-01 18:40:50 [logger.py:42] Received request cmpl-d8872e79dd37495aad337677d9b0896a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:50 [async_llm.py:261] Added request cmpl-d8872e79dd37495aad337677d9b0896a-0. INFO 03-01 18:40:51 [logger.py:42] Received request cmpl-70486c4fb61a4e24b020b9311a4558b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:51 [async_llm.py:261] Added request cmpl-70486c4fb61a4e24b020b9311a4558b7-0. INFO 03-01 18:40:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:40:52 [logger.py:42] Received request cmpl-29690cfd114549b3b00373b8311893fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:52 [async_llm.py:261] Added request cmpl-29690cfd114549b3b00373b8311893fe-0. INFO 03-01 18:40:53 [logger.py:42] Received request cmpl-c4218c00ab4346d5928f2c62dab7b1c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:53 [async_llm.py:261] Added request cmpl-c4218c00ab4346d5928f2c62dab7b1c5-0. INFO 03-01 18:40:54 [logger.py:42] Received request cmpl-10ec5398b4fc48e4803997059ad72933-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:54 [async_llm.py:261] Added request cmpl-10ec5398b4fc48e4803997059ad72933-0. INFO 03-01 18:40:56 [logger.py:42] Received request cmpl-b64ca07c42c443efb308fe9fede750fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:56 [async_llm.py:261] Added request cmpl-b64ca07c42c443efb308fe9fede750fc-0. INFO 03-01 18:40:57 [logger.py:42] Received request cmpl-e2dcc1974679486492f83345bf3affc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:57 [async_llm.py:261] Added request cmpl-e2dcc1974679486492f83345bf3affc5-0. INFO 03-01 18:40:58 [logger.py:42] Received request cmpl-e7e160542f4b462fab6fd287c0c6e019-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:58 [async_llm.py:261] Added request cmpl-e7e160542f4b462fab6fd287c0c6e019-0. INFO 03-01 18:40:59 [logger.py:42] Received request cmpl-704070c0ef194a3f90fb26f9b399f39f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:40:59 [async_llm.py:261] Added request cmpl-704070c0ef194a3f90fb26f9b399f39f-0. INFO 03-01 18:41:00 [logger.py:42] Received request cmpl-42481f7357ab45f3bfc4ed1950076a06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:00 [async_llm.py:261] Added request cmpl-42481f7357ab45f3bfc4ed1950076a06-0. INFO 03-01 18:41:01 [logger.py:42] Received request cmpl-f74462615f294a6c930be5d74845630a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:01 [async_llm.py:261] Added request cmpl-f74462615f294a6c930be5d74845630a-0. INFO 03-01 18:41:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:41:02 [logger.py:42] Received request cmpl-c4e67a1b19924738928c5f452e9f1f33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:02 [async_llm.py:261] Added request cmpl-c4e67a1b19924738928c5f452e9f1f33-0. INFO 03-01 18:41:04 [logger.py:42] Received request cmpl-962437f17721413c8a3064900c90506c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:04 [async_llm.py:261] Added request cmpl-962437f17721413c8a3064900c90506c-0. INFO 03-01 18:41:05 [logger.py:42] Received request cmpl-7d887b7cf27745529b9d14df731e30d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:05 [async_llm.py:261] Added request cmpl-7d887b7cf27745529b9d14df731e30d6-0. INFO 03-01 18:41:06 [logger.py:42] Received request cmpl-1235932280f247bab79319fde515f65a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:06 [async_llm.py:261] Added request cmpl-1235932280f247bab79319fde515f65a-0. INFO 03-01 18:41:07 [logger.py:42] Received request cmpl-08e9a2ff8e1942bbb13433ba539f22ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:07 [async_llm.py:261] Added request cmpl-08e9a2ff8e1942bbb13433ba539f22ad-0. INFO 03-01 18:41:08 [logger.py:42] Received request cmpl-ed889f302a9f4ef6a29b0d9064185945-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:08 [async_llm.py:261] Added request cmpl-ed889f302a9f4ef6a29b0d9064185945-0. INFO 03-01 18:41:09 [logger.py:42] Received request cmpl-96f718048d984b389c43e0a6984376fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:09 [async_llm.py:261] Added request cmpl-96f718048d984b389c43e0a6984376fe-0. INFO 03-01 18:41:11 [logger.py:42] Received request cmpl-f4b3b8b774d24965ba7c495742fc9434-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:11 [async_llm.py:261] Added request cmpl-f4b3b8b774d24965ba7c495742fc9434-0. INFO 03-01 18:41:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:41:12 [logger.py:42] Received request cmpl-b7ecae8d21b84336bb23b171334dcde6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:12 [async_llm.py:261] Added request cmpl-b7ecae8d21b84336bb23b171334dcde6-0. INFO 03-01 18:41:13 [logger.py:42] Received request cmpl-8603d6c378f24377b6aa8bad736cc85b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:13 [async_llm.py:261] Added request cmpl-8603d6c378f24377b6aa8bad736cc85b-0. INFO 03-01 18:41:14 [logger.py:42] Received request cmpl-da1be11a441142d79ab5ee53d0237e42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:14 [async_llm.py:261] Added request cmpl-da1be11a441142d79ab5ee53d0237e42-0. INFO 03-01 18:41:15 [logger.py:42] Received request cmpl-81fd7a24bdde44bb970b479ad9650b03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:15 [async_llm.py:261] Added request cmpl-81fd7a24bdde44bb970b479ad9650b03-0. INFO 03-01 18:41:16 [logger.py:42] Received request cmpl-693c9957bc3949139e04caf29d120399-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:16 [async_llm.py:261] Added request cmpl-693c9957bc3949139e04caf29d120399-0. INFO 03-01 18:41:18 [logger.py:42] Received request cmpl-3041805df59541c4abf54fffbda8dedd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:18 [async_llm.py:261] Added request cmpl-3041805df59541c4abf54fffbda8dedd-0. INFO 03-01 18:41:19 [logger.py:42] Received request cmpl-9893c2e311ed41da84fdfaa8c1636654-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:19 [async_llm.py:261] Added request cmpl-9893c2e311ed41da84fdfaa8c1636654-0. INFO 03-01 18:41:20 [logger.py:42] Received request cmpl-7725295cd70048128525c7b33b17fbbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:20 [async_llm.py:261] Added request cmpl-7725295cd70048128525c7b33b17fbbd-0. INFO 03-01 18:41:21 [logger.py:42] Received request cmpl-5917016b11e2424a802d982821b5d248-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:21 [async_llm.py:261] Added request cmpl-5917016b11e2424a802d982821b5d248-0. INFO 03-01 18:41:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:41:22 [logger.py:42] Received request cmpl-1e5ad7ce8a4b4feb8f57507b3030ffac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:22 [async_llm.py:261] Added request cmpl-1e5ad7ce8a4b4feb8f57507b3030ffac-0. INFO 03-01 18:41:23 [logger.py:42] Received request cmpl-cf4cbde627094d658c1df827cb41f1a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:23 [async_llm.py:261] Added request cmpl-cf4cbde627094d658c1df827cb41f1a1-0. INFO 03-01 18:41:24 [logger.py:42] Received request cmpl-74a577a8d78643ad853775b5540ac321-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:24 [async_llm.py:261] Added request cmpl-74a577a8d78643ad853775b5540ac321-0. INFO 03-01 18:41:26 [logger.py:42] Received request cmpl-97ba193c9bcb45de97d489ada9f2a3f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:26 [async_llm.py:261] Added request cmpl-97ba193c9bcb45de97d489ada9f2a3f6-0. INFO 03-01 18:41:27 [logger.py:42] Received request cmpl-1f6b58f93b4b4de59bd0997c49636759-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:27 [async_llm.py:261] Added request cmpl-1f6b58f93b4b4de59bd0997c49636759-0. INFO 03-01 18:41:28 [logger.py:42] Received request cmpl-0ddb403a7fe44148bbb646260fb98861-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:28 [async_llm.py:261] Added request cmpl-0ddb403a7fe44148bbb646260fb98861-0. INFO 03-01 18:41:29 [logger.py:42] Received request cmpl-f7b5fffd153c435187c2da872281ecff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:29 [async_llm.py:261] Added request cmpl-f7b5fffd153c435187c2da872281ecff-0. INFO 03-01 18:41:30 [logger.py:42] Received request cmpl-84468443761648c09bcafcba9a364a67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:30 [async_llm.py:261] Added request cmpl-84468443761648c09bcafcba9a364a67-0. INFO 03-01 18:41:31 [logger.py:42] Received request cmpl-fd40772f51dc4f21986498734a6be42e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:31 [async_llm.py:261] Added request cmpl-fd40772f51dc4f21986498734a6be42e-0. INFO 03-01 18:41:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.5% INFO 03-01 18:41:33 [logger.py:42] Received request cmpl-4e401bfcf7e646ee9d7ae471e1982cf2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:33 [async_llm.py:261] Added request cmpl-4e401bfcf7e646ee9d7ae471e1982cf2-0. INFO 03-01 18:41:34 [logger.py:42] Received request cmpl-8842384b6c854c8f80a4a42a0026f797-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:34 [async_llm.py:261] Added request cmpl-8842384b6c854c8f80a4a42a0026f797-0. INFO 03-01 18:41:35 [logger.py:42] Received request cmpl-dcad0e0224fd4a4cb395a02d50d335ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:35 [async_llm.py:261] Added request cmpl-dcad0e0224fd4a4cb395a02d50d335ca-0. INFO 03-01 18:41:36 [logger.py:42] Received request cmpl-0bf71ba6c8834318a0b83ac0222ceea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:36 [async_llm.py:261] Added request cmpl-0bf71ba6c8834318a0b83ac0222ceea0-0. INFO 03-01 18:41:37 [logger.py:42] Received request cmpl-d5fe545f1c74490e8814080675bea2ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:37 [async_llm.py:261] Added request cmpl-d5fe545f1c74490e8814080675bea2ca-0. INFO 03-01 18:41:38 [logger.py:42] Received request cmpl-c559f3aadf194a85a285ce4910200ffe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:38 [async_llm.py:261] Added request cmpl-c559f3aadf194a85a285ce4910200ffe-0. INFO 03-01 18:41:40 [logger.py:42] Received request cmpl-df83fa84bbaa49b9b661a9786a74c798-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:40 [async_llm.py:261] Added request cmpl-df83fa84bbaa49b9b661a9786a74c798-0. INFO 03-01 18:41:41 [logger.py:42] Received request cmpl-f5a719eff711410f8b8c29b00a06f9f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:41 [async_llm.py:261] Added request cmpl-f5a719eff711410f8b8c29b00a06f9f4-0. INFO 03-01 18:41:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:41:42 [logger.py:42] Received request cmpl-65f606d01c1e4ed886b9b3590130bb30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:42 [async_llm.py:261] Added request cmpl-65f606d01c1e4ed886b9b3590130bb30-0. INFO 03-01 18:41:43 [logger.py:42] Received request cmpl-f2c5cc89314b4c3faf720dc5628aba06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:43 [async_llm.py:261] Added request cmpl-f2c5cc89314b4c3faf720dc5628aba06-0. INFO 03-01 18:41:44 [logger.py:42] Received request cmpl-683451f17d374899b0a85c2aa8a1d9c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:44 [async_llm.py:261] Added request cmpl-683451f17d374899b0a85c2aa8a1d9c3-0. INFO 03-01 18:41:45 [logger.py:42] Received request cmpl-f46147e237d14656871bf05a716747ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:45 [async_llm.py:261] Added request cmpl-f46147e237d14656871bf05a716747ec-0. INFO 03-01 18:41:46 [logger.py:42] Received request cmpl-22a8c2049a5a4fa8b7119eb7712294b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:46 [async_llm.py:261] Added request cmpl-22a8c2049a5a4fa8b7119eb7712294b3-0. INFO 03-01 18:41:48 [logger.py:42] Received request cmpl-9730ee2c4b8b46fdb774744563b9a436-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:48 [async_llm.py:261] Added request cmpl-9730ee2c4b8b46fdb774744563b9a436-0. INFO 03-01 18:41:49 [logger.py:42] Received request cmpl-68879faa53084dc490fcb46f8f9c53ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:49 [async_llm.py:261] Added request cmpl-68879faa53084dc490fcb46f8f9c53ad-0. INFO 03-01 18:41:50 [logger.py:42] Received request cmpl-6ae303eb93b2427785e428e929dffd45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:50 [async_llm.py:261] Added request cmpl-6ae303eb93b2427785e428e929dffd45-0. INFO 03-01 18:41:51 [logger.py:42] Received request cmpl-4e1b535ea7584a9d84e6f37b8a47c1b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:51 [async_llm.py:261] Added request cmpl-4e1b535ea7584a9d84e6f37b8a47c1b2-0. INFO 03-01 18:41:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:41:52 [logger.py:42] Received request cmpl-3904d8641fd84266b38a687fe730a49b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:52 [async_llm.py:261] Added request cmpl-3904d8641fd84266b38a687fe730a49b-0. INFO 03-01 18:41:53 [logger.py:42] Received request cmpl-d1c8e844d2384564b35e3b1aaed6b79b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:53 [async_llm.py:261] Added request cmpl-d1c8e844d2384564b35e3b1aaed6b79b-0. INFO 03-01 18:41:55 [logger.py:42] Received request cmpl-868b2bf794994276a8073ec5530b5d2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:55 [async_llm.py:261] Added request cmpl-868b2bf794994276a8073ec5530b5d2e-0. INFO 03-01 18:41:56 [logger.py:42] Received request cmpl-43eb221f45b2444294a491f13cdb5d22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:56 [async_llm.py:261] Added request cmpl-43eb221f45b2444294a491f13cdb5d22-0. INFO 03-01 18:41:57 [logger.py:42] Received request cmpl-e31d4fb88a804f76a783e51fbf2cea52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:57 [async_llm.py:261] Added request cmpl-e31d4fb88a804f76a783e51fbf2cea52-0. INFO 03-01 18:41:58 [logger.py:42] Received request cmpl-a7577a0586f04e7f8583287cf9aaa941-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:58 [async_llm.py:261] Added request cmpl-a7577a0586f04e7f8583287cf9aaa941-0. INFO 03-01 18:41:59 [logger.py:42] Received request cmpl-20d8807fe59e437a89a026f194945769-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:41:59 [async_llm.py:261] Added request cmpl-20d8807fe59e437a89a026f194945769-0. INFO 03-01 18:42:00 [logger.py:42] Received request cmpl-beaebb0e8710490a9e649e4d683f99a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:00 [async_llm.py:261] Added request cmpl-beaebb0e8710490a9e649e4d683f99a4-0. INFO 03-01 18:42:02 [logger.py:42] Received request cmpl-8a44567362b64fcb99c04de2b25362ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:02 [async_llm.py:261] Added request cmpl-8a44567362b64fcb99c04de2b25362ac-0. INFO 03-01 18:42:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:42:03 [logger.py:42] Received request cmpl-b57d416019554654952ca10e578e4e3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:03 [async_llm.py:261] Added request cmpl-b57d416019554654952ca10e578e4e3c-0. INFO 03-01 18:42:04 [logger.py:42] Received request cmpl-f5f59b2f88bc409c94e05e1faf7aa23f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:04 [async_llm.py:261] Added request cmpl-f5f59b2f88bc409c94e05e1faf7aa23f-0. INFO 03-01 18:42:05 [logger.py:42] Received request cmpl-5a1e23e23f0f42cf99607e92b85ab3b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:05 [async_llm.py:261] Added request cmpl-5a1e23e23f0f42cf99607e92b85ab3b6-0. INFO 03-01 18:42:06 [logger.py:42] Received request cmpl-b6da7cca4cff48029f46cbebdd184300-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:06 [async_llm.py:261] Added request cmpl-b6da7cca4cff48029f46cbebdd184300-0. INFO 03-01 18:42:07 [logger.py:42] Received request cmpl-ab08ad1d9a4d4a998d2d1cd0a66666e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:07 [async_llm.py:261] Added request cmpl-ab08ad1d9a4d4a998d2d1cd0a66666e5-0. INFO 03-01 18:42:08 [logger.py:42] Received request cmpl-2b3b8845abf44a3ea01812dd9c0925a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:08 [async_llm.py:261] Added request cmpl-2b3b8845abf44a3ea01812dd9c0925a4-0. INFO 03-01 18:42:10 [logger.py:42] Received request cmpl-4dd26f63c6e44d0c8c83e88e5486fc04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:10 [async_llm.py:261] Added request cmpl-4dd26f63c6e44d0c8c83e88e5486fc04-0. INFO 03-01 18:42:11 [logger.py:42] Received request cmpl-7ab267590f684cebbbf5d5cb7cedf505-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:11 [async_llm.py:261] Added request cmpl-7ab267590f684cebbbf5d5cb7cedf505-0. INFO 03-01 18:42:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:42:12 [logger.py:42] Received request cmpl-fcf9dc5ea50d465082ca503e8ef81296-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:12 [async_llm.py:261] Added request cmpl-fcf9dc5ea50d465082ca503e8ef81296-0. INFO 03-01 18:42:13 [logger.py:42] Received request cmpl-6760dc7618494f0bbf1aa4a1838313b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:13 [async_llm.py:261] Added request cmpl-6760dc7618494f0bbf1aa4a1838313b3-0. INFO 03-01 18:42:14 [logger.py:42] Received request cmpl-95ccd8b48218465d8d85ed10a26b8183-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:14 [async_llm.py:261] Added request cmpl-95ccd8b48218465d8d85ed10a26b8183-0. INFO 03-01 18:42:15 [logger.py:42] Received request cmpl-3b361fd82ace4ba8a92e575925c6f318-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:15 [async_llm.py:261] Added request cmpl-3b361fd82ace4ba8a92e575925c6f318-0. INFO 03-01 18:42:17 [logger.py:42] Received request cmpl-14d24085416e4067b9449d38d9786fab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:17 [async_llm.py:261] Added request cmpl-14d24085416e4067b9449d38d9786fab-0. INFO 03-01 18:42:18 [logger.py:42] Received request cmpl-0ac3eefcb88d4951863f4e188931834d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:18 [async_llm.py:261] Added request cmpl-0ac3eefcb88d4951863f4e188931834d-0. INFO 03-01 18:42:19 [logger.py:42] Received request cmpl-282ef4a2aed14441abf40287071f5d14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:19 [async_llm.py:261] Added request cmpl-282ef4a2aed14441abf40287071f5d14-0. INFO 03-01 18:42:20 [logger.py:42] Received request cmpl-a76d2ff7fbf2460c9b9dc7b7268c1782-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:20 [async_llm.py:261] Added request cmpl-a76d2ff7fbf2460c9b9dc7b7268c1782-0. INFO 03-01 18:42:21 [logger.py:42] Received request cmpl-fa97bfa43d5e4913b9fe5f26b9858782-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:21 [async_llm.py:261] Added request cmpl-fa97bfa43d5e4913b9fe5f26b9858782-0. INFO 03-01 18:42:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:42:22 [logger.py:42] Received request cmpl-c8d0225f8d5a4f7fb4a81ef31b14a5e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:22 [async_llm.py:261] Added request cmpl-c8d0225f8d5a4f7fb4a81ef31b14a5e2-0. INFO 03-01 18:42:24 [logger.py:42] Received request cmpl-d44889e660a34162a090a115f5b6fac3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:24 [async_llm.py:261] Added request cmpl-d44889e660a34162a090a115f5b6fac3-0. INFO 03-01 18:42:25 [logger.py:42] Received request cmpl-88b9f655d9514dc1accafc87de68b8fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:25 [async_llm.py:261] Added request cmpl-88b9f655d9514dc1accafc87de68b8fd-0. INFO 03-01 18:42:26 [logger.py:42] Received request cmpl-454b180391474a488b128ec9403fc46b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:26 [async_llm.py:261] Added request cmpl-454b180391474a488b128ec9403fc46b-0. INFO 03-01 18:42:27 [logger.py:42] Received request cmpl-9f575d5035294dce874ac71064d85187-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:27 [async_llm.py:261] Added request cmpl-9f575d5035294dce874ac71064d85187-0. INFO 03-01 18:42:28 [logger.py:42] Received request cmpl-34ff23fde73d4d3baed9d4f1fc9da1ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:28 [async_llm.py:261] Added request cmpl-34ff23fde73d4d3baed9d4f1fc9da1ba-0. INFO 03-01 18:42:29 [logger.py:42] Received request cmpl-93e26b7e1bb34f9f893d17fb69b8e95a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:29 [async_llm.py:261] Added request cmpl-93e26b7e1bb34f9f893d17fb69b8e95a-0. INFO 03-01 18:42:31 [logger.py:42] Received request cmpl-e383515d31b946a3a16c418f9d67412a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:31 [async_llm.py:261] Added request cmpl-e383515d31b946a3a16c418f9d67412a-0. INFO 03-01 18:42:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:42:32 [logger.py:42] Received request cmpl-d3ee63089d98443bb42059bf645ab44b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:32 [async_llm.py:261] Added request cmpl-d3ee63089d98443bb42059bf645ab44b-0. INFO 03-01 18:42:33 [logger.py:42] Received request cmpl-8d796ba6df054469985ca573f571ad22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:33 [async_llm.py:261] Added request cmpl-8d796ba6df054469985ca573f571ad22-0. INFO 03-01 18:42:34 [logger.py:42] Received request cmpl-17c4110c193b4dcd9b593ae2a3d2cc2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:34 [async_llm.py:261] Added request cmpl-17c4110c193b4dcd9b593ae2a3d2cc2a-0. INFO 03-01 18:42:35 [logger.py:42] Received request cmpl-0ca4261399cc45c0bb481f362a27ec7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:35 [async_llm.py:261] Added request cmpl-0ca4261399cc45c0bb481f362a27ec7d-0. INFO 03-01 18:42:36 [logger.py:42] Received request cmpl-854a45623b5545eda7e9c03a5be691ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:36 [async_llm.py:261] Added request cmpl-854a45623b5545eda7e9c03a5be691ed-0. INFO 03-01 18:42:38 [logger.py:42] Received request cmpl-d98a21abd5274c8592d59087efdeffef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:38 [async_llm.py:261] Added request cmpl-d98a21abd5274c8592d59087efdeffef-0. INFO 03-01 18:42:39 [logger.py:42] Received request cmpl-c3fc55b2c2ae41fb974492e02f8141a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:39 [async_llm.py:261] Added request cmpl-c3fc55b2c2ae41fb974492e02f8141a9-0. INFO 03-01 18:42:40 [logger.py:42] Received request cmpl-16aed60e2c7f412c9bc8003c4640e46f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:40 [async_llm.py:261] Added request cmpl-16aed60e2c7f412c9bc8003c4640e46f-0. INFO 03-01 18:42:41 [logger.py:42] Received request cmpl-32b7769b829342c2bcfa3d42ef26ab22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:41 [async_llm.py:261] Added request cmpl-32b7769b829342c2bcfa3d42ef26ab22-0. INFO 03-01 18:42:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:42:42 [logger.py:42] Received request cmpl-e69bfdcf875a4fd5bc3aa58705bfd476-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:42 [async_llm.py:261] Added request cmpl-e69bfdcf875a4fd5bc3aa58705bfd476-0. INFO 03-01 18:42:44 [logger.py:42] Received request cmpl-496d6e525af94b8095df54a48093da24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:44 [async_llm.py:261] Added request cmpl-496d6e525af94b8095df54a48093da24-0. INFO 03-01 18:42:45 [logger.py:42] Received request cmpl-5f013d0c1c094daab7147d1a09be4b99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:45 [async_llm.py:261] Added request cmpl-5f013d0c1c094daab7147d1a09be4b99-0. INFO 03-01 18:42:46 [logger.py:42] Received request cmpl-d24fce39a2d74ceca9f46c0db803c095-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:46 [async_llm.py:261] Added request cmpl-d24fce39a2d74ceca9f46c0db803c095-0. INFO 03-01 18:42:47 [logger.py:42] Received request cmpl-b0263bd9fe5044869f90365c4f775c27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:47 [async_llm.py:261] Added request cmpl-b0263bd9fe5044869f90365c4f775c27-0. INFO 03-01 18:42:48 [logger.py:42] Received request cmpl-c3ca83bd1f0e4a4c82a7381cee2cbd06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:48 [async_llm.py:261] Added request cmpl-c3ca83bd1f0e4a4c82a7381cee2cbd06-0. INFO 03-01 18:42:49 [logger.py:42] Received request cmpl-9cdd284712114e489f134d6e3e5e6970-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:49 [async_llm.py:261] Added request cmpl-9cdd284712114e489f134d6e3e5e6970-0. INFO 03-01 18:42:50 [logger.py:42] Received request cmpl-5fc67e29c07c4dbfa4dd39837ad60d9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:50 [async_llm.py:261] Added request cmpl-5fc67e29c07c4dbfa4dd39837ad60d9c-0. INFO 03-01 18:42:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:42:52 [logger.py:42] Received request cmpl-dbb5975188584d489eca5ebc849445ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:52 [async_llm.py:261] Added request cmpl-dbb5975188584d489eca5ebc849445ec-0. INFO 03-01 18:42:53 [logger.py:42] Received request cmpl-7c929acbc21a40439225046c168dbb5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:53 [async_llm.py:261] Added request cmpl-7c929acbc21a40439225046c168dbb5f-0. INFO 03-01 18:42:54 [logger.py:42] Received request cmpl-5b4e8a5f0a7046f2a79cb482c03a9827-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:54 [async_llm.py:261] Added request cmpl-5b4e8a5f0a7046f2a79cb482c03a9827-0. INFO 03-01 18:42:55 [logger.py:42] Received request cmpl-0a5dce52f0414aea82c0290de583995e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:55 [async_llm.py:261] Added request cmpl-0a5dce52f0414aea82c0290de583995e-0. INFO 03-01 18:42:56 [logger.py:42] Received request cmpl-40be2e343d2d4cdc87d42cc723e6ea77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:56 [async_llm.py:261] Added request cmpl-40be2e343d2d4cdc87d42cc723e6ea77-0. INFO 03-01 18:42:57 [logger.py:42] Received request cmpl-7ffb9d574aa3416cad7042a511bee19b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:57 [async_llm.py:261] Added request cmpl-7ffb9d574aa3416cad7042a511bee19b-0. INFO 03-01 18:42:59 [logger.py:42] Received request cmpl-9f02a0f69a034faea1240e74903cba65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:42:59 [async_llm.py:261] Added request cmpl-9f02a0f69a034faea1240e74903cba65-0. INFO 03-01 18:43:00 [logger.py:42] Received request cmpl-bb84d7633e4b4adfbebd2cdc63eaf659-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:00 [async_llm.py:261] Added request cmpl-bb84d7633e4b4adfbebd2cdc63eaf659-0. INFO 03-01 18:43:01 [logger.py:42] Received request cmpl-332081816b3b475b934130374cf3f23c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:01 [async_llm.py:261] Added request cmpl-332081816b3b475b934130374cf3f23c-0. INFO 03-01 18:43:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:43:02 [logger.py:42] Received request cmpl-1a7d0f25ef9d4aa2bb49289509702fa5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:02 [async_llm.py:261] Added request cmpl-1a7d0f25ef9d4aa2bb49289509702fa5-0. INFO 03-01 18:43:03 [logger.py:42] Received request cmpl-bc4f931a2e5542d0bf4f007ae6374c5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:03 [async_llm.py:261] Added request cmpl-bc4f931a2e5542d0bf4f007ae6374c5d-0. INFO 03-01 18:43:04 [logger.py:42] Received request cmpl-e4e1057306e6483c94e5aefb26852550-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:04 [async_llm.py:261] Added request cmpl-e4e1057306e6483c94e5aefb26852550-0. INFO 03-01 18:43:06 [logger.py:42] Received request cmpl-c7fb010862e940808409488213154df3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:06 [async_llm.py:261] Added request cmpl-c7fb010862e940808409488213154df3-0. INFO 03-01 18:43:07 [logger.py:42] Received request cmpl-178664a70c0c4363b5b30e78f3582168-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:07 [async_llm.py:261] Added request cmpl-178664a70c0c4363b5b30e78f3582168-0. INFO 03-01 18:43:08 [logger.py:42] Received request cmpl-3282718967024d5cb2a79fc8f702e844-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:08 [async_llm.py:261] Added request cmpl-3282718967024d5cb2a79fc8f702e844-0. INFO 03-01 18:43:09 [logger.py:42] Received request cmpl-43f73ff111804357abc89b800629f8d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:09 [async_llm.py:261] Added request cmpl-43f73ff111804357abc89b800629f8d0-0. INFO 03-01 18:43:10 [logger.py:42] Received request cmpl-10df0658cbd44beeb22a754bf08dd824-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:10 [async_llm.py:261] Added request cmpl-10df0658cbd44beeb22a754bf08dd824-0. INFO 03-01 18:43:11 [logger.py:42] Received request cmpl-e8a1a7bdd8b24c0bac00a4c6e086c823-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:11 [async_llm.py:261] Added request cmpl-e8a1a7bdd8b24c0bac00a4c6e086c823-0. INFO 03-01 18:43:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:43:13 [logger.py:42] Received request cmpl-4752220242434657bad204f129555be2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:13 [async_llm.py:261] Added request cmpl-4752220242434657bad204f129555be2-0. INFO 03-01 18:43:14 [logger.py:42] Received request cmpl-8f9ae99ab0d1437eae0f797751cce1ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:14 [async_llm.py:261] Added request cmpl-8f9ae99ab0d1437eae0f797751cce1ff-0. INFO 03-01 18:43:15 [logger.py:42] Received request cmpl-3c60ad86e95641ddbf9f964152371866-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:15 [async_llm.py:261] Added request cmpl-3c60ad86e95641ddbf9f964152371866-0. INFO 03-01 18:43:16 [logger.py:42] Received request cmpl-f6b1510372564e58a47959c5faf7d409-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:16 [async_llm.py:261] Added request cmpl-f6b1510372564e58a47959c5faf7d409-0. INFO 03-01 18:43:17 [logger.py:42] Received request cmpl-33874ee20db740eea6ab78d0d69f57fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:17 [async_llm.py:261] Added request cmpl-33874ee20db740eea6ab78d0d69f57fc-0. INFO 03-01 18:43:18 [logger.py:42] Received request cmpl-451747eeb1bd4de2932a2f204f49205d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:18 [async_llm.py:261] Added request cmpl-451747eeb1bd4de2932a2f204f49205d-0. INFO 03-01 18:43:19 [logger.py:42] Received request cmpl-5b8ca29c5fc04a18b3349525a5945744-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:19 [async_llm.py:261] Added request cmpl-5b8ca29c5fc04a18b3349525a5945744-0. INFO 03-01 18:43:21 [logger.py:42] Received request cmpl-94ecb9f2e50a41bd8342ffc6e8bf174e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:21 [async_llm.py:261] Added request cmpl-94ecb9f2e50a41bd8342ffc6e8bf174e-0. INFO 03-01 18:43:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:43:22 [logger.py:42] Received request cmpl-fa0c6b0ebda4478482945e31c3a52cc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:22 [async_llm.py:261] Added request cmpl-fa0c6b0ebda4478482945e31c3a52cc3-0. INFO 03-01 18:43:23 [logger.py:42] Received request cmpl-7b1d3ccda23242a98e181cf55150af98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:23 [async_llm.py:261] Added request cmpl-7b1d3ccda23242a98e181cf55150af98-0. INFO 03-01 18:43:24 [logger.py:42] Received request cmpl-5f94f615fc1b403285be6e108f518815-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:24 [async_llm.py:261] Added request cmpl-5f94f615fc1b403285be6e108f518815-0. INFO 03-01 18:43:25 [logger.py:42] Received request cmpl-3141bfe19b024bcb8a6c19811d7030f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:25 [async_llm.py:261] Added request cmpl-3141bfe19b024bcb8a6c19811d7030f6-0. INFO 03-01 18:43:26 [logger.py:42] Received request cmpl-b22d1ce447bf461cb814e0640e0c4b8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:26 [async_llm.py:261] Added request cmpl-b22d1ce447bf461cb814e0640e0c4b8d-0. INFO 03-01 18:43:28 [logger.py:42] Received request cmpl-05a3dec2a6e9437ea29b6b2bc995c5c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:28 [async_llm.py:261] Added request cmpl-05a3dec2a6e9437ea29b6b2bc995c5c7-0. INFO 03-01 18:43:29 [logger.py:42] Received request cmpl-1963aa55ad3941a9b7d65f206af8d95c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:29 [async_llm.py:261] Added request cmpl-1963aa55ad3941a9b7d65f206af8d95c-0. INFO 03-01 18:43:30 [logger.py:42] Received request cmpl-a92bd3597839436db78a64fc6f733704-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:30 [async_llm.py:261] Added request cmpl-a92bd3597839436db78a64fc6f733704-0. INFO 03-01 18:43:31 [logger.py:42] Received request cmpl-61fbbbd2cd954c95ad1c7f9c15b7e925-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:31 [async_llm.py:261] Added request cmpl-61fbbbd2cd954c95ad1c7f9c15b7e925-0. INFO 03-01 18:43:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:43:32 [logger.py:42] Received request cmpl-3a6d02ecf10149baaad43dbb83cc7ac2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:32 [async_llm.py:261] Added request cmpl-3a6d02ecf10149baaad43dbb83cc7ac2-0. INFO 03-01 18:43:33 [logger.py:42] Received request cmpl-ea6013dd00cd495ab6202c8b0d87b2cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:33 [async_llm.py:261] Added request cmpl-ea6013dd00cd495ab6202c8b0d87b2cd-0. INFO 03-01 18:43:34 [logger.py:42] Received request cmpl-b4522118f97e42b1adf41afbf8370eef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:34 [async_llm.py:261] Added request cmpl-b4522118f97e42b1adf41afbf8370eef-0. INFO 03-01 18:43:36 [logger.py:42] Received request cmpl-78e06358f02a49e189dcf15f0e478992-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:36 [async_llm.py:261] Added request cmpl-78e06358f02a49e189dcf15f0e478992-0. INFO 03-01 18:43:37 [logger.py:42] Received request cmpl-7b7163b398c14d4bbc7a093c95cfc076-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:37 [async_llm.py:261] Added request cmpl-7b7163b398c14d4bbc7a093c95cfc076-0. INFO 03-01 18:43:38 [logger.py:42] Received request cmpl-f37f8b73bfe54a3bbd938f264513b4bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:38 [async_llm.py:261] Added request cmpl-f37f8b73bfe54a3bbd938f264513b4bf-0. INFO 03-01 18:43:39 [logger.py:42] Received request cmpl-3416c734f28a4845a9ca15992f746243-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:39 [async_llm.py:261] Added request cmpl-3416c734f28a4845a9ca15992f746243-0. INFO 03-01 18:43:40 [logger.py:42] Received request cmpl-2dfa54ec1ffa41f38cf5b25f7034f9cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:40 [async_llm.py:261] Added request cmpl-2dfa54ec1ffa41f38cf5b25f7034f9cd-0. INFO 03-01 18:43:41 [logger.py:42] Received request cmpl-41b5c955340c406595584ad8c2a08c9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:41 [async_llm.py:261] Added request cmpl-41b5c955340c406595584ad8c2a08c9b-0. INFO 03-01 18:43:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.5% INFO 03-01 18:43:43 [logger.py:42] Received request cmpl-d7a21e6d9218466db486e9f2cd88a3d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:43 [async_llm.py:261] Added request cmpl-d7a21e6d9218466db486e9f2cd88a3d3-0. INFO 03-01 18:43:44 [logger.py:42] Received request cmpl-c5247beae36846f88b6ce0dccbc2ebea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:44 [async_llm.py:261] Added request cmpl-c5247beae36846f88b6ce0dccbc2ebea-0. INFO 03-01 18:43:45 [logger.py:42] Received request cmpl-44d1a85bdce84961819bc2db89e0d454-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:45 [async_llm.py:261] Added request cmpl-44d1a85bdce84961819bc2db89e0d454-0. INFO 03-01 18:43:46 [logger.py:42] Received request cmpl-d426163306d043e785cd277c1942a339-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:46 [async_llm.py:261] Added request cmpl-d426163306d043e785cd277c1942a339-0. INFO 03-01 18:43:47 [logger.py:42] Received request cmpl-5e388f99aca740c1a201f99533d4b4ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:47 [async_llm.py:261] Added request cmpl-5e388f99aca740c1a201f99533d4b4ab-0. INFO 03-01 18:43:48 [logger.py:42] Received request cmpl-6e51282a1ac74488b2d0b219ea450a0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:48 [async_llm.py:261] Added request cmpl-6e51282a1ac74488b2d0b219ea450a0a-0. INFO 03-01 18:43:50 [logger.py:42] Received request cmpl-b4e17d6181f84c7d9a6f29d16c72043b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:50 [async_llm.py:261] Added request cmpl-b4e17d6181f84c7d9a6f29d16c72043b-0. INFO 03-01 18:43:51 [logger.py:42] Received request cmpl-5806f13124e8423ebd3bb80f03807616-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:51 [async_llm.py:261] Added request cmpl-5806f13124e8423ebd3bb80f03807616-0. INFO 03-01 18:43:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:43:52 [logger.py:42] Received request cmpl-a90eb68fe038408a8d52114d7c7196c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:52 [async_llm.py:261] Added request cmpl-a90eb68fe038408a8d52114d7c7196c1-0. INFO 03-01 18:43:53 [logger.py:42] Received request cmpl-8cb64df7b38b4d14aa710f224d2dc91a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:53 [async_llm.py:261] Added request cmpl-8cb64df7b38b4d14aa710f224d2dc91a-0. INFO 03-01 18:43:54 [logger.py:42] Received request cmpl-275c70bee0b249fc8c454ac9187d72cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:54 [async_llm.py:261] Added request cmpl-275c70bee0b249fc8c454ac9187d72cf-0. INFO 03-01 18:43:55 [logger.py:42] Received request cmpl-7206e6d82564415f92e0229fc9f55689-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:55 [async_llm.py:261] Added request cmpl-7206e6d82564415f92e0229fc9f55689-0. INFO 03-01 18:43:56 [logger.py:42] Received request cmpl-95dd6874722b47f29e084c8bf73c6ff9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:56 [async_llm.py:261] Added request cmpl-95dd6874722b47f29e084c8bf73c6ff9-0. INFO 03-01 18:43:58 [logger.py:42] Received request cmpl-cd28b185fe5c4f85958d490144e40f68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:58 [async_llm.py:261] Added request cmpl-cd28b185fe5c4f85958d490144e40f68-0. INFO 03-01 18:43:59 [logger.py:42] Received request cmpl-cadb42bede3e46d8a27800e857a2bcc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:43:59 [async_llm.py:261] Added request cmpl-cadb42bede3e46d8a27800e857a2bcc3-0. INFO 03-01 18:44:00 [logger.py:42] Received request cmpl-9cccc6a3477a474fa107ccde85014ed2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:00 [async_llm.py:261] Added request cmpl-9cccc6a3477a474fa107ccde85014ed2-0. INFO 03-01 18:44:01 [logger.py:42] Received request cmpl-b6c42bd3ce9d48cc8d2fea9451998cb8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:01 [async_llm.py:261] Added request cmpl-b6c42bd3ce9d48cc8d2fea9451998cb8-0. INFO 03-01 18:44:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:44:02 [logger.py:42] Received request cmpl-c1fe21279fb743159fa4c5c22c5d73f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:02 [async_llm.py:261] Added request cmpl-c1fe21279fb743159fa4c5c22c5d73f6-0. INFO 03-01 18:44:03 [logger.py:42] Received request cmpl-6cbb9ebce6d2415989c9ac1941990f46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:03 [async_llm.py:261] Added request cmpl-6cbb9ebce6d2415989c9ac1941990f46-0. INFO 03-01 18:44:05 [logger.py:42] Received request cmpl-c2dd07a4db8d427ab61d5a930f858296-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:05 [async_llm.py:261] Added request cmpl-c2dd07a4db8d427ab61d5a930f858296-0. INFO 03-01 18:44:06 [logger.py:42] Received request cmpl-a08ab4e9170a4405b49a5c15e00f6ed5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:06 [async_llm.py:261] Added request cmpl-a08ab4e9170a4405b49a5c15e00f6ed5-0. INFO 03-01 18:44:07 [logger.py:42] Received request cmpl-49b16ae95b9b4428a7b03fb0a09a9865-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:07 [async_llm.py:261] Added request cmpl-49b16ae95b9b4428a7b03fb0a09a9865-0. INFO 03-01 18:44:08 [logger.py:42] Received request cmpl-dd0393bb6b874d1c9b8f79e8728043e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:08 [async_llm.py:261] Added request cmpl-dd0393bb6b874d1c9b8f79e8728043e0-0. INFO 03-01 18:44:09 [logger.py:42] Received request cmpl-125703c1f8af4e0eb810d515b80a3813-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:09 [async_llm.py:261] Added request cmpl-125703c1f8af4e0eb810d515b80a3813-0. INFO 03-01 18:44:10 [logger.py:42] Received request cmpl-e85fa9c243cf4e85b27663ed49822c61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:10 [async_llm.py:261] Added request cmpl-e85fa9c243cf4e85b27663ed49822c61-0. INFO 03-01 18:44:12 [logger.py:42] Received request cmpl-c8dcfba7b4054e91aa06d3cbdbe445d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:12 [async_llm.py:261] Added request cmpl-c8dcfba7b4054e91aa06d3cbdbe445d1-0. INFO 03-01 18:44:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:44:13 [logger.py:42] Received request cmpl-9045a4127cef4d0097017d8229ad5942-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:13 [async_llm.py:261] Added request cmpl-9045a4127cef4d0097017d8229ad5942-0. INFO 03-01 18:44:14 [logger.py:42] Received request cmpl-7dbd14041a5841cf84cf85b6e2e969cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:14 [async_llm.py:261] Added request cmpl-7dbd14041a5841cf84cf85b6e2e969cc-0. INFO 03-01 18:44:15 [logger.py:42] Received request cmpl-a499486129bc4e24b537f11f07827e37-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:15 [async_llm.py:261] Added request cmpl-a499486129bc4e24b537f11f07827e37-0. INFO 03-01 18:44:16 [logger.py:42] Received request cmpl-a8f5a0bad3ac4374ab94f6d2aff62869-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:16 [async_llm.py:261] Added request cmpl-a8f5a0bad3ac4374ab94f6d2aff62869-0. INFO 03-01 18:44:17 [logger.py:42] Received request cmpl-0c8c6d43ad984327b822a4261579ab56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:17 [async_llm.py:261] Added request cmpl-0c8c6d43ad984327b822a4261579ab56-0. INFO 03-01 18:44:18 [logger.py:42] Received request cmpl-a5d87e2a0d754a1095913400b7eb117b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:18 [async_llm.py:261] Added request cmpl-a5d87e2a0d754a1095913400b7eb117b-0. INFO 03-01 18:44:20 [logger.py:42] Received request cmpl-9f66fa6f3a214c6196663b8d096f643d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:20 [async_llm.py:261] Added request cmpl-9f66fa6f3a214c6196663b8d096f643d-0. INFO 03-01 18:44:21 [logger.py:42] Received request cmpl-6b5a2c9541764f4280141b895f9647fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:21 [async_llm.py:261] Added request cmpl-6b5a2c9541764f4280141b895f9647fb-0. INFO 03-01 18:44:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:44:22 [logger.py:42] Received request cmpl-71c508c26f5741b794a8e375b23179d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:22 [async_llm.py:261] Added request cmpl-71c508c26f5741b794a8e375b23179d4-0. INFO 03-01 18:44:23 [logger.py:42] Received request cmpl-af92da8cbdef432ab86313c8fd46e432-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:23 [async_llm.py:261] Added request cmpl-af92da8cbdef432ab86313c8fd46e432-0. INFO 03-01 18:44:24 [logger.py:42] Received request cmpl-53a6fc178f6843699828de9a97b29cde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:24 [async_llm.py:261] Added request cmpl-53a6fc178f6843699828de9a97b29cde-0. INFO 03-01 18:44:25 [logger.py:42] Received request cmpl-770ace23b66b4662a5767c30c8f73c2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:25 [async_llm.py:261] Added request cmpl-770ace23b66b4662a5767c30c8f73c2f-0. INFO 03-01 18:44:27 [logger.py:42] Received request cmpl-3bf25171ce994997bf3f4dca66c8f762-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:27 [async_llm.py:261] Added request cmpl-3bf25171ce994997bf3f4dca66c8f762-0. INFO 03-01 18:44:28 [logger.py:42] Received request cmpl-4fcbcff608044728bf21792ac2f9e8e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:28 [async_llm.py:261] Added request cmpl-4fcbcff608044728bf21792ac2f9e8e0-0. INFO 03-01 18:44:29 [logger.py:42] Received request cmpl-977782382ea448ff8107354ff667414d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:29 [async_llm.py:261] Added request cmpl-977782382ea448ff8107354ff667414d-0. INFO 03-01 18:44:30 [logger.py:42] Received request cmpl-9ff50fba6ea946738210488f3ed3f3ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:30 [async_llm.py:261] Added request cmpl-9ff50fba6ea946738210488f3ed3f3ec-0. INFO 03-01 18:44:31 [logger.py:42] Received request cmpl-6d1e70de57bf48aca982a89d4a631fbb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:31 [async_llm.py:261] Added request cmpl-6d1e70de57bf48aca982a89d4a631fbb-0. INFO 03-01 18:44:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.5% INFO 03-01 18:44:33 [logger.py:42] Received request cmpl-3f4592240f0f43e192d0bc86c74ef475-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:33 [async_llm.py:261] Added request cmpl-3f4592240f0f43e192d0bc86c74ef475-0. INFO 03-01 18:44:34 [logger.py:42] Received request cmpl-59eb047ebcab4353aa7bcd54f8b3e1d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:34 [async_llm.py:261] Added request cmpl-59eb047ebcab4353aa7bcd54f8b3e1d3-0. INFO 03-01 18:44:35 [logger.py:42] Received request cmpl-846f3fd5eedb47db826c5103ff1bc8e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:35 [async_llm.py:261] Added request cmpl-846f3fd5eedb47db826c5103ff1bc8e1-0. INFO 03-01 18:44:36 [logger.py:42] Received request cmpl-15d99f6c9a2d438990ff3dc958499e38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:36 [async_llm.py:261] Added request cmpl-15d99f6c9a2d438990ff3dc958499e38-0. INFO 03-01 18:44:37 [logger.py:42] Received request cmpl-ab9a1c5da2324787941fcce84b05a3a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:37 [async_llm.py:261] Added request cmpl-ab9a1c5da2324787941fcce84b05a3a0-0. INFO 03-01 18:44:38 [logger.py:42] Received request cmpl-72870dba862f461a9249b5ffa714e78b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:38 [async_llm.py:261] Added request cmpl-72870dba862f461a9249b5ffa714e78b-0. INFO 03-01 18:44:40 [logger.py:42] Received request cmpl-8bd168aa1a3d431ea36fb93ba5744931-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:40 [async_llm.py:261] Added request cmpl-8bd168aa1a3d431ea36fb93ba5744931-0. INFO 03-01 18:44:41 [logger.py:42] Received request cmpl-34759fa8653241adbf9f2bc89b97f1e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:41 [async_llm.py:261] Added request cmpl-34759fa8653241adbf9f2bc89b97f1e7-0. INFO 03-01 18:44:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:44:42 [logger.py:42] Received request cmpl-4fae99142ce840daad7bf4b167bbc22b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:42 [async_llm.py:261] Added request cmpl-4fae99142ce840daad7bf4b167bbc22b-0. INFO 03-01 18:44:43 [logger.py:42] Received request cmpl-72522396038f43edbbd20fd71d265712-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:43 [async_llm.py:261] Added request cmpl-72522396038f43edbbd20fd71d265712-0. INFO 03-01 18:44:44 [logger.py:42] Received request cmpl-1305cb0b203d4877a6de7fbaef1cc33b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:44 [async_llm.py:261] Added request cmpl-1305cb0b203d4877a6de7fbaef1cc33b-0. INFO 03-01 18:44:45 [logger.py:42] Received request cmpl-e75dfd45f2f549dfb6011196ca09fb35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:45 [async_llm.py:261] Added request cmpl-e75dfd45f2f549dfb6011196ca09fb35-0. INFO 03-01 18:44:46 [logger.py:42] Received request cmpl-393261d5c32a4345ae7a9ac5cc57d9a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:46 [async_llm.py:261] Added request cmpl-393261d5c32a4345ae7a9ac5cc57d9a4-0. INFO 03-01 18:44:48 [logger.py:42] Received request cmpl-b82c91865bca42a98d48b65c2d0f543c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:48 [async_llm.py:261] Added request cmpl-b82c91865bca42a98d48b65c2d0f543c-0. INFO 03-01 18:44:49 [logger.py:42] Received request cmpl-864328f54e784201afebf8e8194024a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:49 [async_llm.py:261] Added request cmpl-864328f54e784201afebf8e8194024a7-0. INFO 03-01 18:44:50 [logger.py:42] Received request cmpl-466b1a2773744bf0af3a82724a79bcf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:50 [async_llm.py:261] Added request cmpl-466b1a2773744bf0af3a82724a79bcf0-0. INFO 03-01 18:44:51 [logger.py:42] Received request cmpl-5ae6deba1d014c8d82830f5dcfd0e783-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:51 [async_llm.py:261] Added request cmpl-5ae6deba1d014c8d82830f5dcfd0e783-0. INFO 03-01 18:44:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:44:52 [logger.py:42] Received request cmpl-e805a844bc794efb872d8117fcc0e2ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:52 [async_llm.py:261] Added request cmpl-e805a844bc794efb872d8117fcc0e2ca-0. INFO 03-01 18:44:53 [logger.py:42] Received request cmpl-56c47d0bcaef4e17900b7fbd448e527e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:53 [async_llm.py:261] Added request cmpl-56c47d0bcaef4e17900b7fbd448e527e-0. INFO 03-01 18:44:55 [logger.py:42] Received request cmpl-824e7cd9917c42d89ef66291ac6c4680-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:55 [async_llm.py:261] Added request cmpl-824e7cd9917c42d89ef66291ac6c4680-0. INFO 03-01 18:44:56 [logger.py:42] Received request cmpl-edd4f16674d04dd0be0ae376996461ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:56 [async_llm.py:261] Added request cmpl-edd4f16674d04dd0be0ae376996461ab-0. INFO 03-01 18:44:57 [logger.py:42] Received request cmpl-451094143d5f4bcda2030dabaad91ab7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:57 [async_llm.py:261] Added request cmpl-451094143d5f4bcda2030dabaad91ab7-0. INFO 03-01 18:44:58 [logger.py:42] Received request cmpl-7546ba51cb27437eb39f802c70b0f7dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:58 [async_llm.py:261] Added request cmpl-7546ba51cb27437eb39f802c70b0f7dc-0. INFO 03-01 18:44:59 [logger.py:42] Received request cmpl-b4ed00e4129c4dc1b4f2dddca8605ac1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:44:59 [async_llm.py:261] Added request cmpl-b4ed00e4129c4dc1b4f2dddca8605ac1-0. INFO 03-01 18:45:00 [logger.py:42] Received request cmpl-9df5a8dcae7d4201b7a83fa5e6bb9282-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:00 [async_llm.py:261] Added request cmpl-9df5a8dcae7d4201b7a83fa5e6bb9282-0. INFO 03-01 18:45:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:45:02 [logger.py:42] Received request cmpl-e19f3d3a9f85480ba5de83f3ccaed1d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:02 [async_llm.py:261] Added request cmpl-e19f3d3a9f85480ba5de83f3ccaed1d1-0. INFO 03-01 18:45:03 [logger.py:42] Received request cmpl-afa50ed3625e42b0a4be667ffd4991ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:03 [async_llm.py:261] Added request cmpl-afa50ed3625e42b0a4be667ffd4991ad-0. INFO 03-01 18:45:04 [logger.py:42] Received request cmpl-4c69cb43725b4751af7561ab5a38f6ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:04 [async_llm.py:261] Added request cmpl-4c69cb43725b4751af7561ab5a38f6ca-0. INFO 03-01 18:45:05 [logger.py:42] Received request cmpl-0685ffcb3e9d4cfc8b926b1d81de66bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:05 [async_llm.py:261] Added request cmpl-0685ffcb3e9d4cfc8b926b1d81de66bf-0. INFO 03-01 18:45:06 [logger.py:42] Received request cmpl-250178ec75ec4a298fd866490cb3cfbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:06 [async_llm.py:261] Added request cmpl-250178ec75ec4a298fd866490cb3cfbe-0. INFO 03-01 18:45:07 [logger.py:42] Received request cmpl-4484aa5b7d164ac39787c40cebb17387-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:07 [async_llm.py:261] Added request cmpl-4484aa5b7d164ac39787c40cebb17387-0. INFO 03-01 18:45:09 [logger.py:42] Received request cmpl-75c6396f0e194d5988a5150734b443d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:09 [async_llm.py:261] Added request cmpl-75c6396f0e194d5988a5150734b443d8-0. INFO 03-01 18:45:10 [logger.py:42] Received request cmpl-a836f388a55244cca8f60f7813ec9451-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:10 [async_llm.py:261] Added request cmpl-a836f388a55244cca8f60f7813ec9451-0. INFO 03-01 18:45:11 [logger.py:42] Received request cmpl-cd7d981fdf4341a5ab2de8ddf2a0460e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:11 [async_llm.py:261] Added request cmpl-cd7d981fdf4341a5ab2de8ddf2a0460e-0. INFO 03-01 18:45:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:45:12 [logger.py:42] Received request cmpl-4981c2c98bcb49e9a74b770264c20a9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:12 [async_llm.py:261] Added request cmpl-4981c2c98bcb49e9a74b770264c20a9b-0. INFO 03-01 18:45:13 [logger.py:42] Received request cmpl-9dd308f7295a4357b742c3d1f3862368-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:13 [async_llm.py:261] Added request cmpl-9dd308f7295a4357b742c3d1f3862368-0. INFO 03-01 18:45:14 [logger.py:42] Received request cmpl-05d34b786be74a9e933aef33f0a66398-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:14 [async_llm.py:261] Added request cmpl-05d34b786be74a9e933aef33f0a66398-0. INFO 03-01 18:45:15 [logger.py:42] Received request cmpl-e67a1ee8f95a40e0a2defba23c4540b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:15 [async_llm.py:261] Added request cmpl-e67a1ee8f95a40e0a2defba23c4540b0-0. INFO 03-01 18:45:17 [logger.py:42] Received request cmpl-8c39f9e055544b638b7985fab797e0d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:17 [async_llm.py:261] Added request cmpl-8c39f9e055544b638b7985fab797e0d2-0. INFO 03-01 18:45:18 [logger.py:42] Received request cmpl-a0417106122541a69358d744ff6769a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:18 [async_llm.py:261] Added request cmpl-a0417106122541a69358d744ff6769a6-0. INFO 03-01 18:45:19 [logger.py:42] Received request cmpl-0807a1704279471bae4a2019cfe9fb8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:19 [async_llm.py:261] Added request cmpl-0807a1704279471bae4a2019cfe9fb8b-0. INFO 03-01 18:45:20 [logger.py:42] Received request cmpl-a02b2ad6017b4509a0e7dbd25e98bad0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:20 [async_llm.py:261] Added request cmpl-a02b2ad6017b4509a0e7dbd25e98bad0-0. INFO 03-01 18:45:21 [logger.py:42] Received request cmpl-514f7cbcb9e14a86a9ec16a25c233978-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:21 [async_llm.py:261] Added request cmpl-514f7cbcb9e14a86a9ec16a25c233978-0. INFO 03-01 18:45:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:45:22 [logger.py:42] Received request cmpl-89efda2d69a741188b0f6d218c42be02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:22 [async_llm.py:261] Added request cmpl-89efda2d69a741188b0f6d218c42be02-0. INFO 03-01 18:45:24 [logger.py:42] Received request cmpl-9a6fc44ab90e455988aa5f7008c56381-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:24 [async_llm.py:261] Added request cmpl-9a6fc44ab90e455988aa5f7008c56381-0. INFO 03-01 18:45:25 [logger.py:42] Received request cmpl-f7ff487ff77642c38363becd425bb9f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:25 [async_llm.py:261] Added request cmpl-f7ff487ff77642c38363becd425bb9f6-0. INFO 03-01 18:45:26 [logger.py:42] Received request cmpl-49be9b63aed7435894a30d30e54f9efe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:26 [async_llm.py:261] Added request cmpl-49be9b63aed7435894a30d30e54f9efe-0. INFO 03-01 18:45:27 [logger.py:42] Received request cmpl-7d1628347f9a47869628ea0dd85a6879-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:27 [async_llm.py:261] Added request cmpl-7d1628347f9a47869628ea0dd85a6879-0. INFO 03-01 18:45:28 [logger.py:42] Received request cmpl-435982aa14684725a761be16e128da77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:28 [async_llm.py:261] Added request cmpl-435982aa14684725a761be16e128da77-0. INFO 03-01 18:45:29 [logger.py:42] Received request cmpl-dd11b79314d34070b37c0f2974d13242-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:29 [async_llm.py:261] Added request cmpl-dd11b79314d34070b37c0f2974d13242-0. INFO 03-01 18:45:30 [logger.py:42] Received request cmpl-31b12c2338914e6c8dbf5fb3388085ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:30 [async_llm.py:261] Added request cmpl-31b12c2338914e6c8dbf5fb3388085ae-0. INFO 03-01 18:45:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:45:32 [logger.py:42] Received request cmpl-062030030e7c4acf9710dd5012edd3f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:32 [async_llm.py:261] Added request cmpl-062030030e7c4acf9710dd5012edd3f9-0. INFO 03-01 18:45:33 [logger.py:42] Received request cmpl-c4bd8f933eae461bbee1fff8fdaceb4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:33 [async_llm.py:261] Added request cmpl-c4bd8f933eae461bbee1fff8fdaceb4d-0. INFO 03-01 18:45:34 [logger.py:42] Received request cmpl-b6c703a20337459a89713a970ecc0b43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:34 [async_llm.py:261] Added request cmpl-b6c703a20337459a89713a970ecc0b43-0. INFO 03-01 18:45:35 [logger.py:42] Received request cmpl-a01e21cfb26e48f6a9886a2b1214de50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:35 [async_llm.py:261] Added request cmpl-a01e21cfb26e48f6a9886a2b1214de50-0. INFO 03-01 18:45:36 [logger.py:42] Received request cmpl-8424ff0f7ea04865a7fedbd53f036aa8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:36 [async_llm.py:261] Added request cmpl-8424ff0f7ea04865a7fedbd53f036aa8-0. INFO 03-01 18:45:37 [logger.py:42] Received request cmpl-ea8b0cfe3d4549469d48e2140d3b9f31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:37 [async_llm.py:261] Added request cmpl-ea8b0cfe3d4549469d48e2140d3b9f31-0. INFO 03-01 18:45:39 [logger.py:42] Received request cmpl-6ab8ddbf65d54fdfa1648952952dd9f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:39 [async_llm.py:261] Added request cmpl-6ab8ddbf65d54fdfa1648952952dd9f7-0. INFO 03-01 18:45:40 [logger.py:42] Received request cmpl-4a3fc0fdb34447bfbd4d87a8405ee82e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:40 [async_llm.py:261] Added request cmpl-4a3fc0fdb34447bfbd4d87a8405ee82e-0. INFO 03-01 18:45:41 [logger.py:42] Received request cmpl-cc5807bf2b674a8eb147baf8cd9b7741-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:41 [async_llm.py:261] Added request cmpl-cc5807bf2b674a8eb147baf8cd9b7741-0. INFO 03-01 18:45:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:45:42 [logger.py:42] Received request cmpl-c4a534ac385d4d188236e6ce21060f0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:42 [async_llm.py:261] Added request cmpl-c4a534ac385d4d188236e6ce21060f0f-0. INFO 03-01 18:45:43 [logger.py:42] Received request cmpl-f7328e339dc342928241582aedc3f817-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:43 [async_llm.py:261] Added request cmpl-f7328e339dc342928241582aedc3f817-0. INFO 03-01 18:45:45 [logger.py:42] Received request cmpl-99e0055c225c49b582e84de42abe6b3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:45 [async_llm.py:261] Added request cmpl-99e0055c225c49b582e84de42abe6b3b-0. INFO 03-01 18:45:46 [logger.py:42] Received request cmpl-fa40ae0d309946e3872ec30dc5d2de6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:46 [async_llm.py:261] Added request cmpl-fa40ae0d309946e3872ec30dc5d2de6d-0. INFO 03-01 18:45:47 [logger.py:42] Received request cmpl-1dc789338ccc45feae4089c23608a39d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:47 [async_llm.py:261] Added request cmpl-1dc789338ccc45feae4089c23608a39d-0. INFO 03-01 18:45:48 [logger.py:42] Received request cmpl-65f79291263340509cb8f427489af882-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:48 [async_llm.py:261] Added request cmpl-65f79291263340509cb8f427489af882-0. INFO 03-01 18:45:49 [logger.py:42] Received request cmpl-beaf7a959c1445b8b864fc659794f9ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:49 [async_llm.py:261] Added request cmpl-beaf7a959c1445b8b864fc659794f9ce-0. INFO 03-01 18:45:50 [logger.py:42] Received request cmpl-b7466f7be6c6446f9ab005ddb0034a6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:50 [async_llm.py:261] Added request cmpl-b7466f7be6c6446f9ab005ddb0034a6b-0. INFO 03-01 18:45:52 [logger.py:42] Received request cmpl-f125cf088fbc422ababa364ff414c196-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:52 [async_llm.py:261] Added request cmpl-f125cf088fbc422ababa364ff414c196-0. INFO 03-01 18:45:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:45:53 [logger.py:42] Received request cmpl-5e47cf4d740b4478bed6b3013dba6d27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:53 [async_llm.py:261] Added request cmpl-5e47cf4d740b4478bed6b3013dba6d27-0. INFO 03-01 18:45:54 [logger.py:42] Received request cmpl-cb309bb4c81d4f739352fadc1abfb056-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:54 [async_llm.py:261] Added request cmpl-cb309bb4c81d4f739352fadc1abfb056-0. INFO 03-01 18:45:55 [logger.py:42] Received request cmpl-d0d5e727f86e47d4bceda9b0034e23e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:55 [async_llm.py:261] Added request cmpl-d0d5e727f86e47d4bceda9b0034e23e2-0. INFO 03-01 18:45:56 [logger.py:42] Received request cmpl-5d045610d6de4db7ad315536f1271678-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:56 [async_llm.py:261] Added request cmpl-5d045610d6de4db7ad315536f1271678-0. INFO 03-01 18:45:57 [logger.py:42] Received request cmpl-e7317aa213824b6c81400848bdefae1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:57 [async_llm.py:261] Added request cmpl-e7317aa213824b6c81400848bdefae1a-0. INFO 03-01 18:45:58 [logger.py:42] Received request cmpl-f3350af5e32c44c19f43930ce8df44b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:45:58 [async_llm.py:261] Added request cmpl-f3350af5e32c44c19f43930ce8df44b5-0. INFO 03-01 18:46:00 [logger.py:42] Received request cmpl-d76781cafe9e4a50ac731373da80a0c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:00 [async_llm.py:261] Added request cmpl-d76781cafe9e4a50ac731373da80a0c0-0. INFO 03-01 18:46:01 [logger.py:42] Received request cmpl-7a392765d4264d8cbd7494842e0bf697-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:01 [async_llm.py:261] Added request cmpl-7a392765d4264d8cbd7494842e0bf697-0. INFO 03-01 18:46:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:46:02 [logger.py:42] Received request cmpl-dc400ed6409d4bc1852f6ec153d0b09d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:02 [async_llm.py:261] Added request cmpl-dc400ed6409d4bc1852f6ec153d0b09d-0. INFO 03-01 18:46:03 [logger.py:42] Received request cmpl-ef902c92e6bc4baf82da4d07c1732835-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:03 [async_llm.py:261] Added request cmpl-ef902c92e6bc4baf82da4d07c1732835-0. INFO 03-01 18:46:04 [logger.py:42] Received request cmpl-74b5eb1038814c1fa0b7e9f00e38e05b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:04 [async_llm.py:261] Added request cmpl-74b5eb1038814c1fa0b7e9f00e38e05b-0. INFO 03-01 18:46:05 [logger.py:42] Received request cmpl-62231adccbe74cab952ccaa408d6332a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:05 [async_llm.py:261] Added request cmpl-62231adccbe74cab952ccaa408d6332a-0. INFO 03-01 18:46:07 [logger.py:42] Received request cmpl-3f12213f029e4a0ab961e71f36d59e8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:07 [async_llm.py:261] Added request cmpl-3f12213f029e4a0ab961e71f36d59e8d-0. INFO 03-01 18:46:08 [logger.py:42] Received request cmpl-3f7b6e5d7702430b9a3ff32a8661a4c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:08 [async_llm.py:261] Added request cmpl-3f7b6e5d7702430b9a3ff32a8661a4c1-0. INFO 03-01 18:46:09 [logger.py:42] Received request cmpl-24163a576bdf4334894e8ab85ae9a6ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:09 [async_llm.py:261] Added request cmpl-24163a576bdf4334894e8ab85ae9a6ce-0. INFO 03-01 18:46:10 [logger.py:42] Received request cmpl-2a9aaa0abdfe42c99ceab65572dc6d56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:10 [async_llm.py:261] Added request cmpl-2a9aaa0abdfe42c99ceab65572dc6d56-0. INFO 03-01 18:46:11 [logger.py:42] Received request cmpl-f91944af1d084038b3a1bbd022bfc97a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:11 [async_llm.py:261] Added request cmpl-f91944af1d084038b3a1bbd022bfc97a-0. INFO 03-01 18:46:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:46:12 [logger.py:42] Received request cmpl-214b47a5410a4b549eb7aa61a6090fa6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:12 [async_llm.py:261] Added request cmpl-214b47a5410a4b549eb7aa61a6090fa6-0. INFO 03-01 18:46:14 [logger.py:42] Received request cmpl-c82d45d1904148d7bda14c9c594cd02d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:14 [async_llm.py:261] Added request cmpl-c82d45d1904148d7bda14c9c594cd02d-0. INFO 03-01 18:46:15 [logger.py:42] Received request cmpl-695657a845f24c2a93d9f1fff14963e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:15 [async_llm.py:261] Added request cmpl-695657a845f24c2a93d9f1fff14963e0-0. INFO 03-01 18:46:16 [logger.py:42] Received request cmpl-d45f199c686a4009a491d68f40580acd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:16 [async_llm.py:261] Added request cmpl-d45f199c686a4009a491d68f40580acd-0. INFO 03-01 18:46:17 [logger.py:42] Received request cmpl-d0066808cfaf414ab137d3c286975c2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:17 [async_llm.py:261] Added request cmpl-d0066808cfaf414ab137d3c286975c2b-0. INFO 03-01 18:46:18 [logger.py:42] Received request cmpl-afb93c94ab0b4a4b90ea3cdb9d919e5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:18 [async_llm.py:261] Added request cmpl-afb93c94ab0b4a4b90ea3cdb9d919e5c-0. INFO 03-01 18:46:19 [logger.py:42] Received request cmpl-b3db83f6c3f94a11bfc035b8360f8846-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:19 [async_llm.py:261] Added request cmpl-b3db83f6c3f94a11bfc035b8360f8846-0. INFO 03-01 18:46:21 [logger.py:42] Received request cmpl-c071857243654da08dc5de05ff73ac0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:21 [async_llm.py:261] Added request cmpl-c071857243654da08dc5de05ff73ac0f-0. INFO 03-01 18:46:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:46:22 [logger.py:42] Received request cmpl-958bc927889342c3b38e1d9f8011104a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:22 [async_llm.py:261] Added request cmpl-958bc927889342c3b38e1d9f8011104a-0. INFO 03-01 18:46:23 [logger.py:42] Received request cmpl-252db1583c4e4931a1a237ec82a85374-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:23 [async_llm.py:261] Added request cmpl-252db1583c4e4931a1a237ec82a85374-0. INFO 03-01 18:46:24 [logger.py:42] Received request cmpl-ea46d518860b4a15b2d7ab828c193e67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:24 [async_llm.py:261] Added request cmpl-ea46d518860b4a15b2d7ab828c193e67-0. INFO 03-01 18:46:25 [logger.py:42] Received request cmpl-2dd0c4a0ddd64b9bb50867236ed83d7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:25 [async_llm.py:261] Added request cmpl-2dd0c4a0ddd64b9bb50867236ed83d7b-0. INFO 03-01 18:46:26 [logger.py:42] Received request cmpl-877babcf61ad45d080dbbb8418796f15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:26 [async_llm.py:261] Added request cmpl-877babcf61ad45d080dbbb8418796f15-0. INFO 03-01 18:46:27 [logger.py:42] Received request cmpl-7b47ac73a7624edc866d2f8e5b98a46f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:27 [async_llm.py:261] Added request cmpl-7b47ac73a7624edc866d2f8e5b98a46f-0. INFO 03-01 18:46:29 [logger.py:42] Received request cmpl-687c6a78125d4d8f8e2951925ce52656-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:29 [async_llm.py:261] Added request cmpl-687c6a78125d4d8f8e2951925ce52656-0. INFO 03-01 18:46:30 [logger.py:42] Received request cmpl-dd36999a8f8f49d599993190a7c9fc00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:30 [async_llm.py:261] Added request cmpl-dd36999a8f8f49d599993190a7c9fc00-0. INFO 03-01 18:46:31 [logger.py:42] Received request cmpl-c92c15f84022407b89a6a21f8c725b62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:31 [async_llm.py:261] Added request cmpl-c92c15f84022407b89a6a21f8c725b62-0. INFO 03-01 18:46:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:46:32 [logger.py:42] Received request cmpl-edf0f5b3583c48bfaa42bba7742f759b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:32 [async_llm.py:261] Added request cmpl-edf0f5b3583c48bfaa42bba7742f759b-0. INFO 03-01 18:46:33 [logger.py:42] Received request cmpl-1356bdf5a9a442f9bc174dc07f7dffae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:33 [async_llm.py:261] Added request cmpl-1356bdf5a9a442f9bc174dc07f7dffae-0. INFO 03-01 18:46:34 [logger.py:42] Received request cmpl-63b5154124d346118bf74923bdbea19e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:34 [async_llm.py:261] Added request cmpl-63b5154124d346118bf74923bdbea19e-0. INFO 03-01 18:46:36 [logger.py:42] Received request cmpl-eda9961c7a8047c691200e6df52a5a72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:36 [async_llm.py:261] Added request cmpl-eda9961c7a8047c691200e6df52a5a72-0. INFO 03-01 18:46:37 [logger.py:42] Received request cmpl-67bbe1957f30495184785fef9ed8abe3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:37 [async_llm.py:261] Added request cmpl-67bbe1957f30495184785fef9ed8abe3-0. INFO 03-01 18:46:38 [logger.py:42] Received request cmpl-ad4259136516470980264a3a1ca54c5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:38 [async_llm.py:261] Added request cmpl-ad4259136516470980264a3a1ca54c5b-0. INFO 03-01 18:46:39 [logger.py:42] Received request cmpl-62e7ebd648924450a224324e5ee80be4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:39 [async_llm.py:261] Added request cmpl-62e7ebd648924450a224324e5ee80be4-0. INFO 03-01 18:46:40 [logger.py:42] Received request cmpl-3b0012886cc447b4b00c6e9dbefe1fbd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:40 [async_llm.py:261] Added request cmpl-3b0012886cc447b4b00c6e9dbefe1fbd-0. INFO 03-01 18:46:41 [logger.py:42] Received request cmpl-ec0dc65b8a2d45c6a8492e1fc9a1a091-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:41 [async_llm.py:261] Added request cmpl-ec0dc65b8a2d45c6a8492e1fc9a1a091-0. INFO 03-01 18:46:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.5% INFO 03-01 18:46:43 [logger.py:42] Received request cmpl-d8a3ac9ab84b45b78b544e41d66dfad9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:43 [async_llm.py:261] Added request cmpl-d8a3ac9ab84b45b78b544e41d66dfad9-0. INFO 03-01 18:46:44 [logger.py:42] Received request cmpl-cbf4bc0868214cf1881d8f94a7bbd80d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:44 [async_llm.py:261] Added request cmpl-cbf4bc0868214cf1881d8f94a7bbd80d-0. INFO 03-01 18:46:45 [logger.py:42] Received request cmpl-3d4870982c2e48c9aa2d2b7a11db0d18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:45 [async_llm.py:261] Added request cmpl-3d4870982c2e48c9aa2d2b7a11db0d18-0. INFO 03-01 18:46:46 [logger.py:42] Received request cmpl-bc60f11f4e4142d197be45cb5d7a6e7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:46 [async_llm.py:261] Added request cmpl-bc60f11f4e4142d197be45cb5d7a6e7f-0. INFO 03-01 18:46:47 [logger.py:42] Received request cmpl-783e3a9db0614b1c8239e543276d519b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:47 [async_llm.py:261] Added request cmpl-783e3a9db0614b1c8239e543276d519b-0. INFO 03-01 18:46:48 [logger.py:42] Received request cmpl-989926e72614463b8753797b70bf4ab3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:48 [async_llm.py:261] Added request cmpl-989926e72614463b8753797b70bf4ab3-0. INFO 03-01 18:46:50 [logger.py:42] Received request cmpl-0ffbf997a84f4663a56ba672f1d5fdf1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:50 [async_llm.py:261] Added request cmpl-0ffbf997a84f4663a56ba672f1d5fdf1-0. INFO 03-01 18:46:51 [logger.py:42] Received request cmpl-ffe5dcf410824162af1af350be8178b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:51 [async_llm.py:261] Added request cmpl-ffe5dcf410824162af1af350be8178b5-0. INFO 03-01 18:46:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:46:52 [logger.py:42] Received request cmpl-5eabcdf435c84febabf5ea15db2c540d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:52 [async_llm.py:261] Added request cmpl-5eabcdf435c84febabf5ea15db2c540d-0. INFO 03-01 18:46:53 [logger.py:42] Received request cmpl-cbe1dbe638c94c7daa8f8089fe153d68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:53 [async_llm.py:261] Added request cmpl-cbe1dbe638c94c7daa8f8089fe153d68-0. INFO 03-01 18:46:54 [logger.py:42] Received request cmpl-7f19dc0eb0344d7caa17c0f19e28db5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:54 [async_llm.py:261] Added request cmpl-7f19dc0eb0344d7caa17c0f19e28db5d-0. INFO 03-01 18:46:55 [logger.py:42] Received request cmpl-4226e3ea0e404c0681d29b6f2224a83b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:55 [async_llm.py:261] Added request cmpl-4226e3ea0e404c0681d29b6f2224a83b-0. INFO 03-01 18:46:57 [logger.py:42] Received request cmpl-107341b4f11d4ba2afa6edcb84342520-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:57 [async_llm.py:261] Added request cmpl-107341b4f11d4ba2afa6edcb84342520-0. INFO 03-01 18:46:58 [logger.py:42] Received request cmpl-91543351f79c45f4a5d14a458201b13d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:58 [async_llm.py:261] Added request cmpl-91543351f79c45f4a5d14a458201b13d-0. INFO 03-01 18:46:59 [logger.py:42] Received request cmpl-2ac49d19a63444d0bbc8d9cf7c44d7bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:46:59 [async_llm.py:261] Added request cmpl-2ac49d19a63444d0bbc8d9cf7c44d7bd-0. INFO 03-01 18:47:00 [logger.py:42] Received request cmpl-95f168dd52b94adcae61931a4a28a9f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:00 [async_llm.py:261] Added request cmpl-95f168dd52b94adcae61931a4a28a9f4-0. INFO 03-01 18:47:01 [logger.py:42] Received request cmpl-23081158f4d34ef8ac253d273751119a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:01 [async_llm.py:261] Added request cmpl-23081158f4d34ef8ac253d273751119a-0. INFO 03-01 18:47:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:47:02 [logger.py:42] Received request cmpl-3c4ba83828a344a8ba73da77fa45310d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:02 [async_llm.py:261] Added request cmpl-3c4ba83828a344a8ba73da77fa45310d-0. INFO 03-01 18:47:04 [logger.py:42] Received request cmpl-c4b924fa571f434a930e9116b0a65a51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:04 [async_llm.py:261] Added request cmpl-c4b924fa571f434a930e9116b0a65a51-0. INFO 03-01 18:47:05 [logger.py:42] Received request cmpl-d17d822429cd414dbfef17291b175932-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:05 [async_llm.py:261] Added request cmpl-d17d822429cd414dbfef17291b175932-0. INFO 03-01 18:47:06 [logger.py:42] Received request cmpl-6de0b82efffa4aa8a970a0145d31374b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:06 [async_llm.py:261] Added request cmpl-6de0b82efffa4aa8a970a0145d31374b-0. INFO 03-01 18:47:07 [logger.py:42] Received request cmpl-83ad197b7f784d8a9290300d5c5513da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:07 [async_llm.py:261] Added request cmpl-83ad197b7f784d8a9290300d5c5513da-0. INFO 03-01 18:47:08 [logger.py:42] Received request cmpl-e1aee66490914cd98f776d46afe4d27c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:08 [async_llm.py:261] Added request cmpl-e1aee66490914cd98f776d46afe4d27c-0. INFO 03-01 18:47:09 [logger.py:42] Received request cmpl-2f0853bcb1dc492189a299355e47726a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:09 [async_llm.py:261] Added request cmpl-2f0853bcb1dc492189a299355e47726a-0. INFO 03-01 18:47:11 [logger.py:42] Received request cmpl-d43d676222eb4a0f951eb49c15898369-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:11 [async_llm.py:261] Added request cmpl-d43d676222eb4a0f951eb49c15898369-0. INFO 03-01 18:47:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:47:12 [logger.py:42] Received request cmpl-2fba42453c194fad906a305c84d88411-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:12 [async_llm.py:261] Added request cmpl-2fba42453c194fad906a305c84d88411-0. INFO 03-01 18:47:13 [logger.py:42] Received request cmpl-77e86bb803794cb1a5b7e13e93a569f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:13 [async_llm.py:261] Added request cmpl-77e86bb803794cb1a5b7e13e93a569f5-0. INFO 03-01 18:47:14 [logger.py:42] Received request cmpl-79cf3bc9fdbe47a3b6d41dbd890b309f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:14 [async_llm.py:261] Added request cmpl-79cf3bc9fdbe47a3b6d41dbd890b309f-0. INFO 03-01 18:47:15 [logger.py:42] Received request cmpl-9161288cabce41bc932dc5870bb09a4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:15 [async_llm.py:261] Added request cmpl-9161288cabce41bc932dc5870bb09a4d-0. INFO 03-01 18:47:16 [logger.py:42] Received request cmpl-61ffc4ba8c534a5680019af8122a28a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:16 [async_llm.py:261] Added request cmpl-61ffc4ba8c534a5680019af8122a28a2-0. INFO 03-01 18:47:17 [logger.py:42] Received request cmpl-364ca56361cf44058f236ad5268a3239-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:17 [async_llm.py:261] Added request cmpl-364ca56361cf44058f236ad5268a3239-0. INFO 03-01 18:47:19 [logger.py:42] Received request cmpl-fe257f69f8b4453b9fed0b4fcd52044c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:19 [async_llm.py:261] Added request cmpl-fe257f69f8b4453b9fed0b4fcd52044c-0. INFO 03-01 18:47:20 [logger.py:42] Received request cmpl-89e355341ee0490bbbad0c914e58d518-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:20 [async_llm.py:261] Added request cmpl-89e355341ee0490bbbad0c914e58d518-0. INFO 03-01 18:47:21 [logger.py:42] Received request cmpl-5224b86c71aa4c98a0206f27ef4eb527-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:21 [async_llm.py:261] Added request cmpl-5224b86c71aa4c98a0206f27ef4eb527-0. INFO 03-01 18:47:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:47:22 [logger.py:42] Received request cmpl-3fb8a6ed97864cdc9fe889a2f54ffbb0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:22 [async_llm.py:261] Added request cmpl-3fb8a6ed97864cdc9fe889a2f54ffbb0-0. INFO 03-01 18:47:23 [logger.py:42] Received request cmpl-a6ffdd80a2a5436999e90f934ab65d6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:23 [async_llm.py:261] Added request cmpl-a6ffdd80a2a5436999e90f934ab65d6b-0. INFO 03-01 18:47:24 [logger.py:42] Received request cmpl-764c9a1f9ac3449997513dc6c0015e49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:24 [async_llm.py:261] Added request cmpl-764c9a1f9ac3449997513dc6c0015e49-0. INFO 03-01 18:47:26 [logger.py:42] Received request cmpl-633bd5ff01ec4dfa91ee8d1ebdab869a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:26 [async_llm.py:261] Added request cmpl-633bd5ff01ec4dfa91ee8d1ebdab869a-0. INFO 03-01 18:47:27 [logger.py:42] Received request cmpl-9eb23c6d347948edb86807a16ac8f168-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:27 [async_llm.py:261] Added request cmpl-9eb23c6d347948edb86807a16ac8f168-0. INFO 03-01 18:47:28 [logger.py:42] Received request cmpl-e194e85869ce488d8477d686195927d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:28 [async_llm.py:261] Added request cmpl-e194e85869ce488d8477d686195927d0-0. INFO 03-01 18:47:29 [logger.py:42] Received request cmpl-7b01ca509adf4413accf50d293862cba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:29 [async_llm.py:261] Added request cmpl-7b01ca509adf4413accf50d293862cba-0. INFO 03-01 18:47:30 [logger.py:42] Received request cmpl-5c1dbc15e5174ca991ae6f92ba2d9005-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:30 [async_llm.py:261] Added request cmpl-5c1dbc15e5174ca991ae6f92ba2d9005-0. INFO 03-01 18:47:31 [logger.py:42] Received request cmpl-c0a24bd3b07d45259efb2b3a4bd41d46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:31 [async_llm.py:261] Added request cmpl-c0a24bd3b07d45259efb2b3a4bd41d46-0. INFO 03-01 18:47:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.5% INFO 03-01 18:47:33 [logger.py:42] Received request cmpl-859d06eccad44678a184472307789dc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:33 [async_llm.py:261] Added request cmpl-859d06eccad44678a184472307789dc0-0. INFO 03-01 18:47:34 [logger.py:42] Received request cmpl-dadc29d7dbb146f08015ed3c30f882f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:34 [async_llm.py:261] Added request cmpl-dadc29d7dbb146f08015ed3c30f882f5-0. INFO 03-01 18:47:35 [logger.py:42] Received request cmpl-6406fec89f524c57840d33d451440a25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:35 [async_llm.py:261] Added request cmpl-6406fec89f524c57840d33d451440a25-0. INFO 03-01 18:47:36 [logger.py:42] Received request cmpl-920a9c9050ad4a06a5cadb92974ca16f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:36 [async_llm.py:261] Added request cmpl-920a9c9050ad4a06a5cadb92974ca16f-0. INFO 03-01 18:47:37 [logger.py:42] Received request cmpl-75b8f77309104f8d91c2b5b08bbc04a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:37 [async_llm.py:261] Added request cmpl-75b8f77309104f8d91c2b5b08bbc04a5-0. INFO 03-01 18:47:38 [logger.py:42] Received request cmpl-bfea5d7531514c9384081e2b79d07b50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:38 [async_llm.py:261] Added request cmpl-bfea5d7531514c9384081e2b79d07b50-0. INFO 03-01 18:47:39 [logger.py:42] Received request cmpl-0fb44e6af4d84f4e9189ca6d7bd0944d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:39 [async_llm.py:261] Added request cmpl-0fb44e6af4d84f4e9189ca6d7bd0944d-0. INFO 03-01 18:47:41 [logger.py:42] Received request cmpl-600a920b173e4f43bd39296e860b463a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:41 [async_llm.py:261] Added request cmpl-600a920b173e4f43bd39296e860b463a-0. INFO 03-01 18:47:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:47:42 [logger.py:42] Received request cmpl-f91ff184f8f741c88cb0550344eb64c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:42 [async_llm.py:261] Added request cmpl-f91ff184f8f741c88cb0550344eb64c9-0. INFO 03-01 18:47:43 [logger.py:42] Received request cmpl-2fca6817bf9146159f3173fd4c03f67f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:43 [async_llm.py:261] Added request cmpl-2fca6817bf9146159f3173fd4c03f67f-0. INFO 03-01 18:47:44 [logger.py:42] Received request cmpl-815aaeb3764447709b5998f711c5bdb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:44 [async_llm.py:261] Added request cmpl-815aaeb3764447709b5998f711c5bdb2-0. INFO 03-01 18:47:45 [logger.py:42] Received request cmpl-d08bfcdb227c4440ba62326518a95b4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:45 [async_llm.py:261] Added request cmpl-d08bfcdb227c4440ba62326518a95b4e-0. INFO 03-01 18:47:46 [logger.py:42] Received request cmpl-0f03a658a55f49e896412804208a72d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:46 [async_llm.py:261] Added request cmpl-0f03a658a55f49e896412804208a72d9-0. INFO 03-01 18:47:48 [logger.py:42] Received request cmpl-57312eb130394b77a9c09c540d0d85fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:48 [async_llm.py:261] Added request cmpl-57312eb130394b77a9c09c540d0d85fa-0. INFO 03-01 18:47:49 [logger.py:42] Received request cmpl-cbcc29006ae442bf992253d1b3c6d887-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:49 [async_llm.py:261] Added request cmpl-cbcc29006ae442bf992253d1b3c6d887-0. INFO 03-01 18:47:50 [logger.py:42] Received request cmpl-f0476bca0844492d94c3c33e195389b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:50 [async_llm.py:261] Added request cmpl-f0476bca0844492d94c3c33e195389b1-0. INFO 03-01 18:47:51 [logger.py:42] Received request cmpl-9ee4a35a8d0c453da8e505f83d4d9a79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:51 [async_llm.py:261] Added request cmpl-9ee4a35a8d0c453da8e505f83d4d9a79-0. INFO 03-01 18:47:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:47:52 [logger.py:42] Received request cmpl-8cf920774e774f2e8f884a0af4bf32ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:52 [async_llm.py:261] Added request cmpl-8cf920774e774f2e8f884a0af4bf32ca-0. INFO 03-01 18:47:53 [logger.py:42] Received request cmpl-69fafa97f331457d8318490f1e564153-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:53 [async_llm.py:261] Added request cmpl-69fafa97f331457d8318490f1e564153-0. INFO 03-01 18:47:55 [logger.py:42] Received request cmpl-7d447a24c6e445cb8d1c5b06766a5dcb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:55 [async_llm.py:261] Added request cmpl-7d447a24c6e445cb8d1c5b06766a5dcb-0. INFO 03-01 18:47:56 [logger.py:42] Received request cmpl-4ce310adb7b24085a42da0ba989c48d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:56 [async_llm.py:261] Added request cmpl-4ce310adb7b24085a42da0ba989c48d2-0. INFO 03-01 18:47:57 [logger.py:42] Received request cmpl-cea698fc28e64c889167f4c05d258cf7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:57 [async_llm.py:261] Added request cmpl-cea698fc28e64c889167f4c05d258cf7-0. INFO 03-01 18:47:58 [logger.py:42] Received request cmpl-b0ac074ba37b46d794b85df82a8732c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:58 [async_llm.py:261] Added request cmpl-b0ac074ba37b46d794b85df82a8732c8-0. INFO 03-01 18:47:59 [logger.py:42] Received request cmpl-3abfa93c2d21408999d45324084b2cfc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:47:59 [async_llm.py:261] Added request cmpl-3abfa93c2d21408999d45324084b2cfc-0. INFO 03-01 18:48:00 [logger.py:42] Received request cmpl-0b8469d546964054aaf74a737fd5541d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:00 [async_llm.py:261] Added request cmpl-0b8469d546964054aaf74a737fd5541d-0. INFO 03-01 18:48:01 [logger.py:42] Received request cmpl-6e5a7eec5fae4deba3e4e88c98b27bc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:01 [async_llm.py:261] Added request cmpl-6e5a7eec5fae4deba3e4e88c98b27bc4-0. INFO 03-01 18:48:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6% INFO 03-01 18:48:03 [logger.py:42] Received request cmpl-172637335c484529b33da161e0113a9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:03 [async_llm.py:261] Added request cmpl-172637335c484529b33da161e0113a9d-0. INFO 03-01 18:48:04 [logger.py:42] Received request cmpl-16247edc276b449699a20f611da3186d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:04 [async_llm.py:261] Added request cmpl-16247edc276b449699a20f611da3186d-0. INFO 03-01 18:48:05 [logger.py:42] Received request cmpl-fa46b373b5ce4c78b2b8c2ae29635521-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:05 [async_llm.py:261] Added request cmpl-fa46b373b5ce4c78b2b8c2ae29635521-0. INFO 03-01 18:48:06 [logger.py:42] Received request cmpl-567b926ea66d4d8c8d6f47537de65aec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:06 [async_llm.py:261] Added request cmpl-567b926ea66d4d8c8d6f47537de65aec-0. INFO 03-01 18:48:07 [logger.py:42] Received request cmpl-65e17141f38344a6ae700d0ad4474713-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:07 [async_llm.py:261] Added request cmpl-65e17141f38344a6ae700d0ad4474713-0. INFO 03-01 18:48:08 [logger.py:42] Received request cmpl-8265065fb87942aea184d47014281eca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:08 [async_llm.py:261] Added request cmpl-8265065fb87942aea184d47014281eca-0. INFO 03-01 18:48:10 [logger.py:42] Received request cmpl-c8df71a5f01449019e0ad141410ba107-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:10 [async_llm.py:261] Added request cmpl-c8df71a5f01449019e0ad141410ba107-0. INFO 03-01 18:48:11 [logger.py:42] Received request cmpl-06fba805deb4472c8c27d8b3aadf6ecc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:11 [async_llm.py:261] Added request cmpl-06fba805deb4472c8c27d8b3aadf6ecc-0. INFO 03-01 18:48:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:48:12 [logger.py:42] Received request cmpl-d1f6efd397364e7eb905f4a5633588c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:12 [async_llm.py:261] Added request cmpl-d1f6efd397364e7eb905f4a5633588c2-0. INFO 03-01 18:48:13 [logger.py:42] Received request cmpl-d9d32f05d1e449bfbfce0fffd6874bac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:13 [async_llm.py:261] Added request cmpl-d9d32f05d1e449bfbfce0fffd6874bac-0. INFO 03-01 18:48:14 [logger.py:42] Received request cmpl-7d19c2c749b04841bf007c420b5f063f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:14 [async_llm.py:261] Added request cmpl-7d19c2c749b04841bf007c420b5f063f-0. INFO 03-01 18:48:15 [logger.py:42] Received request cmpl-2948cde38c80435abe4f0d12e5f29a1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:15 [async_llm.py:261] Added request cmpl-2948cde38c80435abe4f0d12e5f29a1e-0. INFO 03-01 18:48:17 [logger.py:42] Received request cmpl-001e58a138dc42508fcf2605be3bcdc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:17 [async_llm.py:261] Added request cmpl-001e58a138dc42508fcf2605be3bcdc4-0. INFO 03-01 18:48:18 [logger.py:42] Received request cmpl-8abdf08378fc4438b6883b46f2ea9094-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:18 [async_llm.py:261] Added request cmpl-8abdf08378fc4438b6883b46f2ea9094-0. INFO 03-01 18:48:19 [logger.py:42] Received request cmpl-9f07f19a7e4548eabe504b0029a563ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:19 [async_llm.py:261] Added request cmpl-9f07f19a7e4548eabe504b0029a563ec-0. INFO 03-01 18:48:20 [logger.py:42] Received request cmpl-4c411536b49042328271d98ac9ba6284-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:20 [async_llm.py:261] Added request cmpl-4c411536b49042328271d98ac9ba6284-0. INFO 03-01 18:48:21 [logger.py:42] Received request cmpl-474332959d2a46258a7376072cf34e17-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:21 [async_llm.py:261] Added request cmpl-474332959d2a46258a7376072cf34e17-0. INFO 03-01 18:48:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:48:22 [logger.py:42] Received request cmpl-076cf8fc33bb43c099f1b60e9a5bba2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:22 [async_llm.py:261] Added request cmpl-076cf8fc33bb43c099f1b60e9a5bba2c-0. INFO 03-01 18:48:23 [logger.py:42] Received request cmpl-0531a8085b964e8ea164443ff5eab27f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:23 [async_llm.py:261] Added request cmpl-0531a8085b964e8ea164443ff5eab27f-0. INFO 03-01 18:48:25 [logger.py:42] Received request cmpl-915a3d0605f241a98abc751ce9713d4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:25 [async_llm.py:261] Added request cmpl-915a3d0605f241a98abc751ce9713d4f-0. INFO 03-01 18:48:26 [logger.py:42] Received request cmpl-9029272b59c84262884d92f8fd7330c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:26 [async_llm.py:261] Added request cmpl-9029272b59c84262884d92f8fd7330c3-0. INFO 03-01 18:48:27 [logger.py:42] Received request cmpl-9f611333c55b4c07bf73434bf260136f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:27 [async_llm.py:261] Added request cmpl-9f611333c55b4c07bf73434bf260136f-0. INFO 03-01 18:48:28 [logger.py:42] Received request cmpl-e1e415f200654f0c93ca6ba6771a1ed4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:28 [async_llm.py:261] Added request cmpl-e1e415f200654f0c93ca6ba6771a1ed4-0. INFO 03-01 18:48:29 [logger.py:42] Received request cmpl-0aacd82fb5bb4eda8ea4362e253008cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:29 [async_llm.py:261] Added request cmpl-0aacd82fb5bb4eda8ea4362e253008cd-0. INFO 03-01 18:48:30 [logger.py:42] Received request cmpl-7577db422d1646eca102d10f528ee82c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:30 [async_llm.py:261] Added request cmpl-7577db422d1646eca102d10f528ee82c-0. INFO 03-01 18:48:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:48:32 [logger.py:42] Received request cmpl-580a68fbdbda40f0819664b7e6932e49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:32 [async_llm.py:261] Added request cmpl-580a68fbdbda40f0819664b7e6932e49-0. INFO 03-01 18:48:33 [logger.py:42] Received request cmpl-c064336e235b449fb57cae059e8db712-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:33 [async_llm.py:261] Added request cmpl-c064336e235b449fb57cae059e8db712-0. INFO 03-01 18:48:34 [logger.py:42] Received request cmpl-9ecf7b3a57814109b56585fbbcaed531-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:34 [async_llm.py:261] Added request cmpl-9ecf7b3a57814109b56585fbbcaed531-0. INFO 03-01 18:48:35 [logger.py:42] Received request cmpl-cdda55fca01f486dbc6888dbd9de6091-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:35 [async_llm.py:261] Added request cmpl-cdda55fca01f486dbc6888dbd9de6091-0. INFO 03-01 18:48:36 [logger.py:42] Received request cmpl-44db18fbd9b74b3f9da87b2fad8f6679-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:36 [async_llm.py:261] Added request cmpl-44db18fbd9b74b3f9da87b2fad8f6679-0. INFO 03-01 18:48:37 [logger.py:42] Received request cmpl-eedcfdde7a4e456b97abb870252d494b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:37 [async_llm.py:261] Added request cmpl-eedcfdde7a4e456b97abb870252d494b-0. INFO 03-01 18:48:39 [logger.py:42] Received request cmpl-4478459169e64cd49072001381bd9e75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:39 [async_llm.py:261] Added request cmpl-4478459169e64cd49072001381bd9e75-0. INFO 03-01 18:48:40 [logger.py:42] Received request cmpl-51a387f400d54d2987eb9f3bba7ce378-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:40 [async_llm.py:261] Added request cmpl-51a387f400d54d2987eb9f3bba7ce378-0. INFO 03-01 18:48:41 [logger.py:42] Received request cmpl-6571915deef54aafa73d3b53f5f05bf9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:41 [async_llm.py:261] Added request cmpl-6571915deef54aafa73d3b53f5f05bf9-0. INFO 03-01 18:48:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:48:42 [logger.py:42] Received request cmpl-4ec5e0fd64634f8e9ba51fa7f131fd5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:42 [async_llm.py:261] Added request cmpl-4ec5e0fd64634f8e9ba51fa7f131fd5c-0. INFO 03-01 18:48:43 [logger.py:42] Received request cmpl-ed7d239eab8f4f7db304e9e7778bdeff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:43 [async_llm.py:261] Added request cmpl-ed7d239eab8f4f7db304e9e7778bdeff-0. INFO 03-01 18:48:44 [logger.py:42] Received request cmpl-5d5e9fd4c41e47aeae69b0c815df47e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:44 [async_llm.py:261] Added request cmpl-5d5e9fd4c41e47aeae69b0c815df47e7-0. INFO 03-01 18:48:45 [logger.py:42] Received request cmpl-24d6a36fa29243ff87fd4d7f5457c884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:45 [async_llm.py:261] Added request cmpl-24d6a36fa29243ff87fd4d7f5457c884-0. INFO 03-01 18:48:47 [logger.py:42] Received request cmpl-2a4cf0806f8d47e7af231fd32ca92dd6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:47 [async_llm.py:261] Added request cmpl-2a4cf0806f8d47e7af231fd32ca92dd6-0. INFO 03-01 18:48:48 [logger.py:42] Received request cmpl-e112af43e211456089d827f1b5e49ae8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:48 [async_llm.py:261] Added request cmpl-e112af43e211456089d827f1b5e49ae8-0. INFO 03-01 18:48:49 [logger.py:42] Received request cmpl-82d165f9316241c69a5b1c06e5cb0208-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:49 [async_llm.py:261] Added request cmpl-82d165f9316241c69a5b1c06e5cb0208-0. INFO 03-01 18:48:50 [logger.py:42] Received request cmpl-df19d1d7643443e29cdf02d721d207f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:50 [async_llm.py:261] Added request cmpl-df19d1d7643443e29cdf02d721d207f3-0. INFO 03-01 18:48:51 [logger.py:42] Received request cmpl-b36ccec9841e48b2ace1af3d0e20428d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:51 [async_llm.py:261] Added request cmpl-b36ccec9841e48b2ace1af3d0e20428d-0. INFO 03-01 18:48:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:48:52 [logger.py:42] Received request cmpl-afeb3c8d36cd4a978e33d407f45c3f36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:52 [async_llm.py:261] Added request cmpl-afeb3c8d36cd4a978e33d407f45c3f36-0. INFO 03-01 18:48:54 [logger.py:42] Received request cmpl-0c804fad01854b478dbcd9c2dddc99ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:54 [async_llm.py:261] Added request cmpl-0c804fad01854b478dbcd9c2dddc99ef-0. INFO 03-01 18:48:55 [logger.py:42] Received request cmpl-a0aefa6e4430418fa82c066d23e8edd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:55 [async_llm.py:261] Added request cmpl-a0aefa6e4430418fa82c066d23e8edd8-0. INFO 03-01 18:48:56 [logger.py:42] Received request cmpl-7c6a58fb80414198b5ab98d6ed2f112b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:56 [async_llm.py:261] Added request cmpl-7c6a58fb80414198b5ab98d6ed2f112b-0. INFO 03-01 18:48:57 [logger.py:42] Received request cmpl-9c76bd93f8a543f1a3e36bd391cedb5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:57 [async_llm.py:261] Added request cmpl-9c76bd93f8a543f1a3e36bd391cedb5b-0. INFO 03-01 18:48:58 [logger.py:42] Received request cmpl-f7e4293df9f8429687f71d7e4b457b9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:58 [async_llm.py:261] Added request cmpl-f7e4293df9f8429687f71d7e4b457b9e-0. INFO 03-01 18:48:59 [logger.py:42] Received request cmpl-d7c38f58137c45e6978a096ad4b3ec48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:48:59 [async_llm.py:261] Added request cmpl-d7c38f58137c45e6978a096ad4b3ec48-0. INFO 03-01 18:49:01 [logger.py:42] Received request cmpl-db844cc1b1704cda841671c8ba7cc960-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:01 [async_llm.py:261] Added request cmpl-db844cc1b1704cda841671c8ba7cc960-0. INFO 03-01 18:49:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:49:02 [logger.py:42] Received request cmpl-f0cc7547b05d4752a0f1a63f5f6eb258-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:02 [async_llm.py:261] Added request cmpl-f0cc7547b05d4752a0f1a63f5f6eb258-0. INFO 03-01 18:49:03 [logger.py:42] Received request cmpl-c0e7dd8e2d604c4986d9c6954aca6ba1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:03 [async_llm.py:261] Added request cmpl-c0e7dd8e2d604c4986d9c6954aca6ba1-0. INFO 03-01 18:49:04 [logger.py:42] Received request cmpl-66d7fb3795534ce4a974cca5b1552b40-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:04 [async_llm.py:261] Added request cmpl-66d7fb3795534ce4a974cca5b1552b40-0. INFO 03-01 18:49:05 [logger.py:42] Received request cmpl-f34893aef2b84b6d8aff028257badcd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:05 [async_llm.py:261] Added request cmpl-f34893aef2b84b6d8aff028257badcd7-0. INFO 03-01 18:49:06 [logger.py:42] Received request cmpl-3c2fa5bdac874abe8bec4650cf133a5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:06 [async_llm.py:261] Added request cmpl-3c2fa5bdac874abe8bec4650cf133a5e-0. INFO 03-01 18:49:07 [logger.py:42] Received request cmpl-8c2decc8e0584bd8ad35ca3d8dd76cf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:07 [async_llm.py:261] Added request cmpl-8c2decc8e0584bd8ad35ca3d8dd76cf8-0. INFO 03-01 18:49:09 [logger.py:42] Received request cmpl-48a29594b63340eaa58c19cc71d7ab60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:09 [async_llm.py:261] Added request cmpl-48a29594b63340eaa58c19cc71d7ab60-0. INFO 03-01 18:49:10 [logger.py:42] Received request cmpl-719f6c3283f9410e9d55b7855fa6d71e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:10 [async_llm.py:261] Added request cmpl-719f6c3283f9410e9d55b7855fa6d71e-0. INFO 03-01 18:49:11 [logger.py:42] Received request cmpl-a036ee71e0b444159da407f1ebe4b0c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:11 [async_llm.py:261] Added request cmpl-a036ee71e0b444159da407f1ebe4b0c9-0. INFO 03-01 18:49:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:49:12 [logger.py:42] Received request cmpl-e066697341a1435d8fa38da341f0fa76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:12 [async_llm.py:261] Added request cmpl-e066697341a1435d8fa38da341f0fa76-0. INFO 03-01 18:49:13 [logger.py:42] Received request cmpl-89101193b5ef4800821890d284c88403-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:13 [async_llm.py:261] Added request cmpl-89101193b5ef4800821890d284c88403-0. INFO 03-01 18:49:14 [logger.py:42] Received request cmpl-e1ab6b884a4140899cea15d75b5ecf64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:14 [async_llm.py:261] Added request cmpl-e1ab6b884a4140899cea15d75b5ecf64-0. INFO 03-01 18:49:16 [logger.py:42] Received request cmpl-51a21dc042494073bc30da1917830eed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:16 [async_llm.py:261] Added request cmpl-51a21dc042494073bc30da1917830eed-0. INFO 03-01 18:49:17 [logger.py:42] Received request cmpl-37ea0dff81854963b99ba617503e65af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:17 [async_llm.py:261] Added request cmpl-37ea0dff81854963b99ba617503e65af-0. INFO 03-01 18:49:18 [logger.py:42] Received request cmpl-d531d4ccf97d4cbd9e96ce51b8556fc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:18 [async_llm.py:261] Added request cmpl-d531d4ccf97d4cbd9e96ce51b8556fc7-0. INFO 03-01 18:49:19 [logger.py:42] Received request cmpl-56fee26d0f1d487caa74b48368f57700-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:19 [async_llm.py:261] Added request cmpl-56fee26d0f1d487caa74b48368f57700-0. INFO 03-01 18:49:20 [logger.py:42] Received request cmpl-37c63149d7744aeb958973a702d0de24-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:20 [async_llm.py:261] Added request cmpl-37c63149d7744aeb958973a702d0de24-0. INFO 03-01 18:49:21 [logger.py:42] Received request cmpl-6b731b9a432546ecaf9c4cdd6d8df23e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:21 [async_llm.py:261] Added request cmpl-6b731b9a432546ecaf9c4cdd6d8df23e-0. INFO 03-01 18:49:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:49:23 [logger.py:42] Received request cmpl-34eaa670a1e64f6ca321d0d48b502edc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:23 [async_llm.py:261] Added request cmpl-34eaa670a1e64f6ca321d0d48b502edc-0. INFO 03-01 18:49:24 [logger.py:42] Received request cmpl-ea3517bb975842059ea3b19757f73315-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:24 [async_llm.py:261] Added request cmpl-ea3517bb975842059ea3b19757f73315-0. INFO 03-01 18:49:25 [logger.py:42] Received request cmpl-c340d7b59ac5491d913c2bc613cbc87c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:25 [async_llm.py:261] Added request cmpl-c340d7b59ac5491d913c2bc613cbc87c-0. INFO 03-01 18:49:26 [logger.py:42] Received request cmpl-44bc918d781a48b48857948222a6fbc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:26 [async_llm.py:261] Added request cmpl-44bc918d781a48b48857948222a6fbc5-0. INFO 03-01 18:49:27 [logger.py:42] Received request cmpl-c56f713dca6d464885502bd36a867918-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:27 [async_llm.py:261] Added request cmpl-c56f713dca6d464885502bd36a867918-0. INFO 03-01 18:49:28 [logger.py:42] Received request cmpl-ddb186250b3946ae82a71e4edfbd638d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:28 [async_llm.py:261] Added request cmpl-ddb186250b3946ae82a71e4edfbd638d-0. INFO 03-01 18:49:30 [logger.py:42] Received request cmpl-9127eacd6e684fe1afef880949829d15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:30 [async_llm.py:261] Added request cmpl-9127eacd6e684fe1afef880949829d15-0. INFO 03-01 18:49:31 [logger.py:42] Received request cmpl-7453d8a886ff4a859131071561c05037-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:31 [async_llm.py:261] Added request cmpl-7453d8a886ff4a859131071561c05037-0. INFO 03-01 18:49:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:49:32 [logger.py:42] Received request cmpl-8888f4833c894d48901c881277a5aa20-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:32 [async_llm.py:261] Added request cmpl-8888f4833c894d48901c881277a5aa20-0. INFO 03-01 18:49:33 [logger.py:42] Received request cmpl-badc7f2750c54acab39722866142fb2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:33 [async_llm.py:261] Added request cmpl-badc7f2750c54acab39722866142fb2d-0. INFO 03-01 18:49:34 [logger.py:42] Received request cmpl-7e0cb4b4b34d4440846afa12f7ac3e3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:34 [async_llm.py:261] Added request cmpl-7e0cb4b4b34d4440846afa12f7ac3e3b-0. INFO 03-01 18:49:35 [logger.py:42] Received request cmpl-c985cf4302ab4512bd25344fa46f97f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:35 [async_llm.py:261] Added request cmpl-c985cf4302ab4512bd25344fa46f97f6-0. INFO 03-01 18:49:36 [logger.py:42] Received request cmpl-ec2b77c7794e4f3dab84764772ee5de6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:36 [async_llm.py:261] Added request cmpl-ec2b77c7794e4f3dab84764772ee5de6-0. INFO 03-01 18:49:38 [logger.py:42] Received request cmpl-b5338862bd9242e6b49ef6f8cc7bec1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:38 [async_llm.py:261] Added request cmpl-b5338862bd9242e6b49ef6f8cc7bec1f-0. INFO 03-01 18:49:39 [logger.py:42] Received request cmpl-205a00f1691f4182be8ce792b3df581f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:39 [async_llm.py:261] Added request cmpl-205a00f1691f4182be8ce792b3df581f-0. INFO 03-01 18:49:40 [logger.py:42] Received request cmpl-1c9ac5dfd0714c4cad74d5e4fff193c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:40 [async_llm.py:261] Added request cmpl-1c9ac5dfd0714c4cad74d5e4fff193c8-0. INFO 03-01 18:49:41 [logger.py:42] Received request cmpl-938176db57604b39aaf59b87aaf256cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:41 [async_llm.py:261] Added request cmpl-938176db57604b39aaf59b87aaf256cd-0. INFO 03-01 18:49:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:49:42 [logger.py:42] Received request cmpl-afac6270cb454d2e84b8156a106cc227-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:42 [async_llm.py:261] Added request cmpl-afac6270cb454d2e84b8156a106cc227-0. INFO 03-01 18:49:43 [logger.py:42] Received request cmpl-2314f25752434f14895325bc76f5a0e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:43 [async_llm.py:261] Added request cmpl-2314f25752434f14895325bc76f5a0e2-0. INFO 03-01 18:49:45 [logger.py:42] Received request cmpl-7c3c95857bb7466bbc6f36ca9bbc3762-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:45 [async_llm.py:261] Added request cmpl-7c3c95857bb7466bbc6f36ca9bbc3762-0. INFO 03-01 18:49:46 [logger.py:42] Received request cmpl-a255220372d14ea98d5a8cc2851fce0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:46 [async_llm.py:261] Added request cmpl-a255220372d14ea98d5a8cc2851fce0f-0. INFO 03-01 18:49:47 [logger.py:42] Received request cmpl-3e09a402a28f4ef6b2e1e3940ab85ee9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:47 [async_llm.py:261] Added request cmpl-3e09a402a28f4ef6b2e1e3940ab85ee9-0. INFO 03-01 18:49:48 [logger.py:42] Received request cmpl-cf4bc362fe5f4dd595472de51d5cc496-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:48 [async_llm.py:261] Added request cmpl-cf4bc362fe5f4dd595472de51d5cc496-0. INFO 03-01 18:49:49 [logger.py:42] Received request cmpl-005d89167a104f0fa7345cadcc8c62dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:49 [async_llm.py:261] Added request cmpl-005d89167a104f0fa7345cadcc8c62dd-0. INFO 03-01 18:49:50 [logger.py:42] Received request cmpl-58bdc25bb8674bd09f7838d0a764fc55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:50 [async_llm.py:261] Added request cmpl-58bdc25bb8674bd09f7838d0a764fc55-0. INFO 03-01 18:49:52 [logger.py:42] Received request cmpl-a7d1bc30490648a789099333b4a23d42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:52 [async_llm.py:261] Added request cmpl-a7d1bc30490648a789099333b4a23d42-0. INFO 03-01 18:49:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6% INFO 03-01 18:49:53 [logger.py:42] Received request cmpl-ba85ac275e964fd39440dc67f34a1ad3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:53 [async_llm.py:261] Added request cmpl-ba85ac275e964fd39440dc67f34a1ad3-0. INFO 03-01 18:49:54 [logger.py:42] Received request cmpl-1065b655759f4a9d8d96fd1a4fd70267-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:54 [async_llm.py:261] Added request cmpl-1065b655759f4a9d8d96fd1a4fd70267-0. INFO 03-01 18:49:55 [logger.py:42] Received request cmpl-b2e52d1c35de4ad892625ccc649155b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:55 [async_llm.py:261] Added request cmpl-b2e52d1c35de4ad892625ccc649155b5-0. INFO 03-01 18:49:56 [logger.py:42] Received request cmpl-40aaeb9a654a48ac8a90a000a927ad86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:56 [async_llm.py:261] Added request cmpl-40aaeb9a654a48ac8a90a000a927ad86-0. INFO 03-01 18:49:57 [logger.py:42] Received request cmpl-c9d3a048fd484ef88dbce1469f732707-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:57 [async_llm.py:261] Added request cmpl-c9d3a048fd484ef88dbce1469f732707-0. INFO 03-01 18:49:58 [logger.py:42] Received request cmpl-cf46e07519ce43e5a9950831f04c9b67-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:49:58 [async_llm.py:261] Added request cmpl-cf46e07519ce43e5a9950831f04c9b67-0. INFO 03-01 18:50:00 [logger.py:42] Received request cmpl-6e782709df7f4bfaa9ae08b611fafb64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:00 [async_llm.py:261] Added request cmpl-6e782709df7f4bfaa9ae08b611fafb64-0. INFO 03-01 18:50:01 [logger.py:42] Received request cmpl-33b7266feff849cdab25b9787aacec41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:01 [async_llm.py:261] Added request cmpl-33b7266feff849cdab25b9787aacec41-0. INFO 03-01 18:50:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:50:02 [logger.py:42] Received request cmpl-00b2cb4e6be64798af7f8a89f0d8ca30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:02 [async_llm.py:261] Added request cmpl-00b2cb4e6be64798af7f8a89f0d8ca30-0. INFO 03-01 18:50:03 [logger.py:42] Received request cmpl-30e93dfa5f2a4d6394fae668cac5279d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:03 [async_llm.py:261] Added request cmpl-30e93dfa5f2a4d6394fae668cac5279d-0. INFO 03-01 18:50:04 [logger.py:42] Received request cmpl-953513478b04499b80d6b5ea01d20bc1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:04 [async_llm.py:261] Added request cmpl-953513478b04499b80d6b5ea01d20bc1-0. INFO 03-01 18:50:05 [logger.py:42] Received request cmpl-0e8559363def450ebfde478c47792203-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:05 [async_llm.py:261] Added request cmpl-0e8559363def450ebfde478c47792203-0. INFO 03-01 18:50:07 [logger.py:42] Received request cmpl-285591378f204229ae0711b9835f31d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:07 [async_llm.py:261] Added request cmpl-285591378f204229ae0711b9835f31d3-0. INFO 03-01 18:50:08 [logger.py:42] Received request cmpl-97ae12a780b942f79cff0daa7032df99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:08 [async_llm.py:261] Added request cmpl-97ae12a780b942f79cff0daa7032df99-0. INFO 03-01 18:50:09 [logger.py:42] Received request cmpl-cfb7f8ea4eab43cf86a6e89f8ff7d577-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:09 [async_llm.py:261] Added request cmpl-cfb7f8ea4eab43cf86a6e89f8ff7d577-0. INFO 03-01 18:50:10 [logger.py:42] Received request cmpl-482c4eafa90f4529b9bf50662a31f4a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:10 [async_llm.py:261] Added request cmpl-482c4eafa90f4529b9bf50662a31f4a8-0. INFO 03-01 18:50:11 [logger.py:42] Received request cmpl-d9b774f7a6b8498e8ccc44d1161eaa98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:11 [async_llm.py:261] Added request cmpl-d9b774f7a6b8498e8ccc44d1161eaa98-0. INFO 03-01 18:50:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:50:12 [logger.py:42] Received request cmpl-edd1c0698a3840d5906083ab18a48799-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:12 [async_llm.py:261] Added request cmpl-edd1c0698a3840d5906083ab18a48799-0. INFO 03-01 18:50:14 [logger.py:42] Received request cmpl-b8bd657f10884829bc0bb6116af394f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:14 [async_llm.py:261] Added request cmpl-b8bd657f10884829bc0bb6116af394f1-0. INFO 03-01 18:50:15 [logger.py:42] Received request cmpl-31bc4610bd704a818829a12fa0a1f2b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:15 [async_llm.py:261] Added request cmpl-31bc4610bd704a818829a12fa0a1f2b3-0. INFO 03-01 18:50:16 [logger.py:42] Received request cmpl-b88f75fddfdd4fde82e99ec56bdf2fd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:16 [async_llm.py:261] Added request cmpl-b88f75fddfdd4fde82e99ec56bdf2fd0-0. INFO 03-01 18:50:17 [logger.py:42] Received request cmpl-2bcf0f3142b644ce9819398686ee23d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:17 [async_llm.py:261] Added request cmpl-2bcf0f3142b644ce9819398686ee23d1-0. INFO 03-01 18:50:18 [logger.py:42] Received request cmpl-2bec9fc5f0a1457eaff60bfe7c1cb91f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:18 [async_llm.py:261] Added request cmpl-2bec9fc5f0a1457eaff60bfe7c1cb91f-0. INFO 03-01 18:50:20 [logger.py:42] Received request cmpl-2ee0f40d8aea4690a827d59407e90ae4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:20 [async_llm.py:261] Added request cmpl-2ee0f40d8aea4690a827d59407e90ae4-0. INFO 03-01 18:50:21 [logger.py:42] Received request cmpl-440c98ae86fc4d0eb4f2e362626e39bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:21 [async_llm.py:261] Added request cmpl-440c98ae86fc4d0eb4f2e362626e39bf-0. INFO 03-01 18:50:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:50:22 [logger.py:42] Received request cmpl-3ea7a820aeb9460e801118920969a93b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:22 [async_llm.py:261] Added request cmpl-3ea7a820aeb9460e801118920969a93b-0. INFO 03-01 18:50:23 [logger.py:42] Received request cmpl-6245c6045dc44f4088027b7b7d72374e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:23 [async_llm.py:261] Added request cmpl-6245c6045dc44f4088027b7b7d72374e-0. INFO 03-01 18:50:24 [logger.py:42] Received request cmpl-c1683737cc6c4a75ac5c95341ad6c5fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:24 [async_llm.py:261] Added request cmpl-c1683737cc6c4a75ac5c95341ad6c5fb-0. INFO 03-01 18:50:25 [logger.py:42] Received request cmpl-df8ef9036de64fafa9fda76950c1a00d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:25 [async_llm.py:261] Added request cmpl-df8ef9036de64fafa9fda76950c1a00d-0. INFO 03-01 18:50:27 [logger.py:42] Received request cmpl-cb17f7af60cf4fdc837ef71df9e384c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:27 [async_llm.py:261] Added request cmpl-cb17f7af60cf4fdc837ef71df9e384c0-0. INFO 03-01 18:50:28 [logger.py:42] Received request cmpl-98049c164c6041c1ba1214735da1b8ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:28 [async_llm.py:261] Added request cmpl-98049c164c6041c1ba1214735da1b8ff-0. INFO 03-01 18:50:29 [logger.py:42] Received request cmpl-715fd63e15134d75bb418818fbceadcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:29 [async_llm.py:261] Added request cmpl-715fd63e15134d75bb418818fbceadcc-0. INFO 03-01 18:50:30 [logger.py:42] Received request cmpl-ea0e559cd0ca4b2cb0d5effc255d4f59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:30 [async_llm.py:261] Added request cmpl-ea0e559cd0ca4b2cb0d5effc255d4f59-0. INFO 03-01 18:50:31 [logger.py:42] Received request cmpl-5b100fd61c8c44e38f8b1c8fd67cbd5e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:31 [async_llm.py:261] Added request cmpl-5b100fd61c8c44e38f8b1c8fd67cbd5e-0. INFO 03-01 18:50:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:50:32 [logger.py:42] Received request cmpl-ef161d8d6f6e472e91ba40e6e1049795-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:32 [async_llm.py:261] Added request cmpl-ef161d8d6f6e472e91ba40e6e1049795-0. INFO 03-01 18:50:33 [logger.py:42] Received request cmpl-868729a6b8ca4eaabfe3afbffd56947f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:33 [async_llm.py:261] Added request cmpl-868729a6b8ca4eaabfe3afbffd56947f-0. INFO 03-01 18:50:35 [logger.py:42] Received request cmpl-d688512497be4fe0b238f9f7416597f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:35 [async_llm.py:261] Added request cmpl-d688512497be4fe0b238f9f7416597f6-0. INFO 03-01 18:50:36 [logger.py:42] Received request cmpl-98c766742ee74b14a97466e916fc67fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:36 [async_llm.py:261] Added request cmpl-98c766742ee74b14a97466e916fc67fd-0. INFO 03-01 18:50:37 [logger.py:42] Received request cmpl-e758c2fea12d4a369aa4a9a1963b46db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:37 [async_llm.py:261] Added request cmpl-e758c2fea12d4a369aa4a9a1963b46db-0. INFO 03-01 18:50:38 [logger.py:42] Received request cmpl-e841d65fb14e4a83a91d5ca570eb126f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:38 [async_llm.py:261] Added request cmpl-e841d65fb14e4a83a91d5ca570eb126f-0. INFO 03-01 18:50:39 [logger.py:42] Received request cmpl-0f1ef321fb70486aae7287294d97127e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:39 [async_llm.py:261] Added request cmpl-0f1ef321fb70486aae7287294d97127e-0. INFO 03-01 18:50:40 [logger.py:42] Received request cmpl-9e0cc6d00d7e44d4a6e0928186a15e75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:40 [async_llm.py:261] Added request cmpl-9e0cc6d00d7e44d4a6e0928186a15e75-0. INFO 03-01 18:50:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:50:42 [logger.py:42] Received request cmpl-edff7c2dc95f400d993f4ad9a1eb2208-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:42 [async_llm.py:261] Added request cmpl-edff7c2dc95f400d993f4ad9a1eb2208-0. INFO 03-01 18:50:43 [logger.py:42] Received request cmpl-ef7bb00e80914678b1df520985a70845-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:43 [async_llm.py:261] Added request cmpl-ef7bb00e80914678b1df520985a70845-0. INFO 03-01 18:50:44 [logger.py:42] Received request cmpl-cf42e228c45247b7b398525b14ef87f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:44 [async_llm.py:261] Added request cmpl-cf42e228c45247b7b398525b14ef87f8-0. INFO 03-01 18:50:45 [logger.py:42] Received request cmpl-dd8252d3370a43bb98b90a2f53e5e9ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:45 [async_llm.py:261] Added request cmpl-dd8252d3370a43bb98b90a2f53e5e9ac-0. INFO 03-01 18:50:46 [logger.py:42] Received request cmpl-da492f921849419aaaa5a379502d0d7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:46 [async_llm.py:261] Added request cmpl-da492f921849419aaaa5a379502d0d7d-0. INFO 03-01 18:50:47 [logger.py:42] Received request cmpl-82072423ceea4a12b9370dd064c374f2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:47 [async_llm.py:261] Added request cmpl-82072423ceea4a12b9370dd064c374f2-0. INFO 03-01 18:50:49 [logger.py:42] Received request cmpl-4c7df8b1d7394a44ba966570d4938379-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:49 [async_llm.py:261] Added request cmpl-4c7df8b1d7394a44ba966570d4938379-0. INFO 03-01 18:50:50 [logger.py:42] Received request cmpl-318efd7c45dc4a468548464c67d96820-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:50 [async_llm.py:261] Added request cmpl-318efd7c45dc4a468548464c67d96820-0. INFO 03-01 18:50:51 [logger.py:42] Received request cmpl-0b5de5371fa44727b19f08169fb009b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:51 [async_llm.py:261] Added request cmpl-0b5de5371fa44727b19f08169fb009b8-0. INFO 03-01 18:50:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:50:52 [logger.py:42] Received request cmpl-a94565e5a91f4c6d8f0a614659000823-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:52 [async_llm.py:261] Added request cmpl-a94565e5a91f4c6d8f0a614659000823-0. INFO 03-01 18:50:53 [logger.py:42] Received request cmpl-761f7e6d9e6444ccab421d270dfcc9cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:53 [async_llm.py:261] Added request cmpl-761f7e6d9e6444ccab421d270dfcc9cf-0. INFO 03-01 18:50:54 [logger.py:42] Received request cmpl-77321156b1b94565b7206d63a1bd97f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:54 [async_llm.py:261] Added request cmpl-77321156b1b94565b7206d63a1bd97f6-0. INFO 03-01 18:50:56 [logger.py:42] Received request cmpl-3c0bb3eeb1254f619d9b26fbf16f8b88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:56 [async_llm.py:261] Added request cmpl-3c0bb3eeb1254f619d9b26fbf16f8b88-0. INFO 03-01 18:50:57 [logger.py:42] Received request cmpl-875adbc52fb34569bdcd06740a6a8c0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:57 [async_llm.py:261] Added request cmpl-875adbc52fb34569bdcd06740a6a8c0d-0. INFO 03-01 18:50:58 [logger.py:42] Received request cmpl-93f4eea5ea0f4e67b34c50362f0e6281-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:58 [async_llm.py:261] Added request cmpl-93f4eea5ea0f4e67b34c50362f0e6281-0. INFO 03-01 18:50:59 [logger.py:42] Received request cmpl-bfd3635f7b4a4216a7667fb4693e8b65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:50:59 [async_llm.py:261] Added request cmpl-bfd3635f7b4a4216a7667fb4693e8b65-0. INFO 03-01 18:51:00 [logger.py:42] Received request cmpl-b70afea8f3ac4f66a0863515adb16ccb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:00 [async_llm.py:261] Added request cmpl-b70afea8f3ac4f66a0863515adb16ccb-0. INFO 03-01 18:51:01 [logger.py:42] Received request cmpl-e39d0d8a21014421a4762b55b209e92d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:01 [async_llm.py:261] Added request cmpl-e39d0d8a21014421a4762b55b209e92d-0. INFO 03-01 18:51:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:51:03 [logger.py:42] Received request cmpl-5b1bbcd3c6ef4a01a623f43bc7a78000-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:03 [async_llm.py:261] Added request cmpl-5b1bbcd3c6ef4a01a623f43bc7a78000-0. INFO 03-01 18:51:04 [logger.py:42] Received request cmpl-23c685f5de334db483e8258a483f1328-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:04 [async_llm.py:261] Added request cmpl-23c685f5de334db483e8258a483f1328-0. INFO 03-01 18:51:05 [logger.py:42] Received request cmpl-52289b61682a43ab8f005aa57c6734e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:05 [async_llm.py:261] Added request cmpl-52289b61682a43ab8f005aa57c6734e8-0. INFO 03-01 18:51:06 [logger.py:42] Received request cmpl-88ef0ef0afdc4d8d912264db1f62268b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:06 [async_llm.py:261] Added request cmpl-88ef0ef0afdc4d8d912264db1f62268b-0. INFO 03-01 18:51:07 [logger.py:42] Received request cmpl-021a11a51a9943e0a36a3141c807ea21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:07 [async_llm.py:261] Added request cmpl-021a11a51a9943e0a36a3141c807ea21-0. INFO 03-01 18:51:09 [logger.py:42] Received request cmpl-12bd7ef8b63b4e3bb1b89ab59405acd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:09 [async_llm.py:261] Added request cmpl-12bd7ef8b63b4e3bb1b89ab59405acd3-0. INFO 03-01 18:51:10 [logger.py:42] Received request cmpl-bf439c674dcd4138957bda44ecc6c917-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:10 [async_llm.py:261] Added request cmpl-bf439c674dcd4138957bda44ecc6c917-0. INFO 03-01 18:51:11 [logger.py:42] Received request cmpl-f441b9f419d1456d8c3450e985c24e7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:11 [async_llm.py:261] Added request cmpl-f441b9f419d1456d8c3450e985c24e7e-0. INFO 03-01 18:51:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:51:12 [logger.py:42] Received request cmpl-c85fa9a3a461405f81812af58624ad9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:12 [async_llm.py:261] Added request cmpl-c85fa9a3a461405f81812af58624ad9d-0. INFO 03-01 18:51:13 [logger.py:42] Received request cmpl-c032682fd16741f3a1e99a75a5b7495f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:13 [async_llm.py:261] Added request cmpl-c032682fd16741f3a1e99a75a5b7495f-0. INFO 03-01 18:51:14 [logger.py:42] Received request cmpl-c29b0acb801547bdb972c57217231681-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:14 [async_llm.py:261] Added request cmpl-c29b0acb801547bdb972c57217231681-0. INFO 03-01 18:51:16 [logger.py:42] Received request cmpl-73ca772b11ed4bbb9092e106251fc1cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:16 [async_llm.py:261] Added request cmpl-73ca772b11ed4bbb9092e106251fc1cc-0. INFO 03-01 18:51:17 [logger.py:42] Received request cmpl-c0fc5c36c2a1451484e1fdfc0ad100e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:17 [async_llm.py:261] Added request cmpl-c0fc5c36c2a1451484e1fdfc0ad100e0-0. INFO 03-01 18:51:18 [logger.py:42] Received request cmpl-a973224a4ac8408ab677dbf9f04f74de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:18 [async_llm.py:261] Added request cmpl-a973224a4ac8408ab677dbf9f04f74de-0. INFO 03-01 18:51:19 [logger.py:42] Received request cmpl-c5939c94d1054b2e893a5c554fd005da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:19 [async_llm.py:261] Added request cmpl-c5939c94d1054b2e893a5c554fd005da-0. INFO 03-01 18:51:20 [logger.py:42] Received request cmpl-9a569129b72047529dd06877b29a2b4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:20 [async_llm.py:261] Added request cmpl-9a569129b72047529dd06877b29a2b4f-0. INFO 03-01 18:51:21 [logger.py:42] Received request cmpl-93f649253f2148f5bbeebc9e7fc1deba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:21 [async_llm.py:261] Added request cmpl-93f649253f2148f5bbeebc9e7fc1deba-0. INFO 03-01 18:51:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:51:23 [logger.py:42] Received request cmpl-fb74a2920efc40fa998306e7b5636735-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:23 [async_llm.py:261] Added request cmpl-fb74a2920efc40fa998306e7b5636735-0. INFO 03-01 18:51:24 [logger.py:42] Received request cmpl-9e44c667b4964b979c354864d4049dbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:24 [async_llm.py:261] Added request cmpl-9e44c667b4964b979c354864d4049dbf-0. INFO 03-01 18:51:25 [logger.py:42] Received request cmpl-8267c2b4489f423bbce7cb53388994b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:25 [async_llm.py:261] Added request cmpl-8267c2b4489f423bbce7cb53388994b2-0. INFO 03-01 18:51:26 [logger.py:42] Received request cmpl-ba417258f4be4f878740b1dc0ff2efee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:26 [async_llm.py:261] Added request cmpl-ba417258f4be4f878740b1dc0ff2efee-0. INFO 03-01 18:51:27 [logger.py:42] Received request cmpl-f3a64771c6534c889aef548858028bc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:27 [async_llm.py:261] Added request cmpl-f3a64771c6534c889aef548858028bc8-0. INFO 03-01 18:51:28 [logger.py:42] Received request cmpl-143c1e69b70245f5ac377b587d3d1a42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:28 [async_llm.py:261] Added request cmpl-143c1e69b70245f5ac377b587d3d1a42-0. INFO 03-01 18:51:30 [logger.py:42] Received request cmpl-f738282a002442d9b6caf6ddead15b6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:30 [async_llm.py:261] Added request cmpl-f738282a002442d9b6caf6ddead15b6d-0. INFO 03-01 18:51:31 [logger.py:42] Received request cmpl-c90b80a7737d4c4289ca7533170a1ae4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:31 [async_llm.py:261] Added request cmpl-c90b80a7737d4c4289ca7533170a1ae4-0. INFO 03-01 18:51:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:51:32 [logger.py:42] Received request cmpl-428e2daa87ce4cf0ac02188a58579ded-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:32 [async_llm.py:261] Added request cmpl-428e2daa87ce4cf0ac02188a58579ded-0. INFO 03-01 18:51:33 [logger.py:42] Received request cmpl-fda6e4dfe2b04e41aeb961230a5be4e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:33 [async_llm.py:261] Added request cmpl-fda6e4dfe2b04e41aeb961230a5be4e4-0. INFO 03-01 18:51:34 [logger.py:42] Received request cmpl-68ee16b4de874f29a81573b1780ce397-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:34 [async_llm.py:261] Added request cmpl-68ee16b4de874f29a81573b1780ce397-0. INFO 03-01 18:51:35 [logger.py:42] Received request cmpl-d366d5e712254b5e810de17e14c80a93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:35 [async_llm.py:261] Added request cmpl-d366d5e712254b5e810de17e14c80a93-0. INFO 03-01 18:51:37 [logger.py:42] Received request cmpl-1deba25650f7487ca37bef732173c3a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:37 [async_llm.py:261] Added request cmpl-1deba25650f7487ca37bef732173c3a0-0. INFO 03-01 18:51:38 [logger.py:42] Received request cmpl-fb417abd97dd4e11b619a57dd05ce5d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:38 [async_llm.py:261] Added request cmpl-fb417abd97dd4e11b619a57dd05ce5d7-0. INFO 03-01 18:51:39 [logger.py:42] Received request cmpl-0b4bccf895774bdf9f3f0a5be1849e58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:39 [async_llm.py:261] Added request cmpl-0b4bccf895774bdf9f3f0a5be1849e58-0. INFO 03-01 18:51:40 [logger.py:42] Received request cmpl-b280f927929146c184d96252fa90b26d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:40 [async_llm.py:261] Added request cmpl-b280f927929146c184d96252fa90b26d-0. INFO 03-01 18:51:41 [logger.py:42] Received request cmpl-cc40928186b24600bcfcbb695e2b9a7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:41 [async_llm.py:261] Added request cmpl-cc40928186b24600bcfcbb695e2b9a7d-0. INFO 03-01 18:51:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:51:42 [logger.py:42] Received request cmpl-cdc1e8de625047e887f4ba0cf8d48950-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:42 [async_llm.py:261] Added request cmpl-cdc1e8de625047e887f4ba0cf8d48950-0. INFO 03-01 18:51:44 [logger.py:42] Received request cmpl-927223fcfeb644cdbb1e781adc749fc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:44 [async_llm.py:261] Added request cmpl-927223fcfeb644cdbb1e781adc749fc7-0. INFO 03-01 18:51:45 [logger.py:42] Received request cmpl-a618de1ae09a4c58a1b30297807c4cad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:45 [async_llm.py:261] Added request cmpl-a618de1ae09a4c58a1b30297807c4cad-0. INFO 03-01 18:51:46 [logger.py:42] Received request cmpl-867b2fb7a9754676aa30a51997fcfdaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:46 [async_llm.py:261] Added request cmpl-867b2fb7a9754676aa30a51997fcfdaf-0. INFO 03-01 18:51:47 [logger.py:42] Received request cmpl-f0904ee13d154c7993e77d4e2e349c71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:47 [async_llm.py:261] Added request cmpl-f0904ee13d154c7993e77d4e2e349c71-0. INFO 03-01 18:51:48 [logger.py:42] Received request cmpl-b38b34fba3974be4bc0de9ecd965186d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:48 [async_llm.py:261] Added request cmpl-b38b34fba3974be4bc0de9ecd965186d-0. INFO 03-01 18:51:49 [logger.py:42] Received request cmpl-a6b3335087724eb78ba89913758c00b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:49 [async_llm.py:261] Added request cmpl-a6b3335087724eb78ba89913758c00b5-0. INFO 03-01 18:51:51 [logger.py:42] Received request cmpl-99a8851ba89e4ed8bd9e7bcbc4b6bd63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:51 [async_llm.py:261] Added request cmpl-99a8851ba89e4ed8bd9e7bcbc4b6bd63-0. INFO 03-01 18:51:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:51:52 [logger.py:42] Received request cmpl-2985af8d6c054f51aa9743858e542a71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:52 [async_llm.py:261] Added request cmpl-2985af8d6c054f51aa9743858e542a71-0. INFO 03-01 18:51:53 [logger.py:42] Received request cmpl-e56d5f8d1c994165a40f6ec980a51c93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:53 [async_llm.py:261] Added request cmpl-e56d5f8d1c994165a40f6ec980a51c93-0. INFO 03-01 18:51:54 [logger.py:42] Received request cmpl-dd5f99bd131a4b6399b56991e9a59c0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:54 [async_llm.py:261] Added request cmpl-dd5f99bd131a4b6399b56991e9a59c0a-0. INFO 03-01 18:51:55 [logger.py:42] Received request cmpl-fc8ca4687e4b4b8c94d12e992362e530-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:55 [async_llm.py:261] Added request cmpl-fc8ca4687e4b4b8c94d12e992362e530-0. INFO 03-01 18:51:56 [logger.py:42] Received request cmpl-4f2fd890405349ab828dc34f39ff5846-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:56 [async_llm.py:261] Added request cmpl-4f2fd890405349ab828dc34f39ff5846-0. INFO 03-01 18:51:58 [logger.py:42] Received request cmpl-c6570960b170409aa0ec74ee530fffa4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:58 [async_llm.py:261] Added request cmpl-c6570960b170409aa0ec74ee530fffa4-0. INFO 03-01 18:51:59 [logger.py:42] Received request cmpl-4199b3b9e7bd450883d6bcdda6a610c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:51:59 [async_llm.py:261] Added request cmpl-4199b3b9e7bd450883d6bcdda6a610c2-0. INFO 03-01 18:52:00 [logger.py:42] Received request cmpl-fcf40fa91b8d4690ae4a7f6e16392acb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:00 [async_llm.py:261] Added request cmpl-fcf40fa91b8d4690ae4a7f6e16392acb-0. INFO 03-01 18:52:01 [logger.py:42] Received request cmpl-768a41d5a24b472992f11d9536cc8d2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:01 [async_llm.py:261] Added request cmpl-768a41d5a24b472992f11d9536cc8d2d-0. INFO 03-01 18:52:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:52:02 [logger.py:42] Received request cmpl-8967addf85ef4d69822711cdbba0c80d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:02 [async_llm.py:261] Added request cmpl-8967addf85ef4d69822711cdbba0c80d-0. INFO 03-01 18:52:03 [logger.py:42] Received request cmpl-22a6b69e85394ecbade211926013f301-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:03 [async_llm.py:261] Added request cmpl-22a6b69e85394ecbade211926013f301-0. INFO 03-01 18:52:05 [logger.py:42] Received request cmpl-fb237dfa101d4de288980c4a4473bd13-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:05 [async_llm.py:261] Added request cmpl-fb237dfa101d4de288980c4a4473bd13-0. INFO 03-01 18:52:06 [logger.py:42] Received request cmpl-c93f8d928fd44dfea48e302ddd4a4402-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:06 [async_llm.py:261] Added request cmpl-c93f8d928fd44dfea48e302ddd4a4402-0. INFO 03-01 18:52:07 [logger.py:42] Received request cmpl-3db5bcfe598244f7ac124c38a943dea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:07 [async_llm.py:261] Added request cmpl-3db5bcfe598244f7ac124c38a943dea0-0. INFO 03-01 18:52:08 [logger.py:42] Received request cmpl-9e942bfcc7e04c42b257222aa112db04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:08 [async_llm.py:261] Added request cmpl-9e942bfcc7e04c42b257222aa112db04-0. INFO 03-01 18:52:09 [logger.py:42] Received request cmpl-14068c9b3c864fc4954b0c1d956e216e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:09 [async_llm.py:261] Added request cmpl-14068c9b3c864fc4954b0c1d956e216e-0. INFO 03-01 18:52:10 [logger.py:42] Received request cmpl-bcf7efd37b704031a695c83460d42790-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:10 [async_llm.py:261] Added request cmpl-bcf7efd37b704031a695c83460d42790-0. INFO 03-01 18:52:12 [logger.py:42] Received request cmpl-77e18356837642388740f678469c09eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:12 [async_llm.py:261] Added request cmpl-77e18356837642388740f678469c09eb-0. INFO 03-01 18:52:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:52:13 [logger.py:42] Received request cmpl-d52ce24a03da470587822089645a097b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:13 [async_llm.py:261] Added request cmpl-d52ce24a03da470587822089645a097b-0. INFO 03-01 18:52:14 [logger.py:42] Received request cmpl-265a33f02bf84c6ab793fc31291fe45a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:14 [async_llm.py:261] Added request cmpl-265a33f02bf84c6ab793fc31291fe45a-0. INFO 03-01 18:52:15 [logger.py:42] Received request cmpl-3343754f6c4e4b3d896f3977f70dd730-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:15 [async_llm.py:261] Added request cmpl-3343754f6c4e4b3d896f3977f70dd730-0. INFO 03-01 18:52:16 [logger.py:42] Received request cmpl-840d28feb11447ac9498468f6a4ae223-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:16 [async_llm.py:261] Added request cmpl-840d28feb11447ac9498468f6a4ae223-0. INFO 03-01 18:52:17 [logger.py:42] Received request cmpl-feccea9cef5d4162969d2d12c61ba135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:17 [async_llm.py:261] Added request cmpl-feccea9cef5d4162969d2d12c61ba135-0. INFO 03-01 18:52:19 [logger.py:42] Received request cmpl-aec5e19bc3654f5aa850e0aad907ae73-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:19 [async_llm.py:261] Added request cmpl-aec5e19bc3654f5aa850e0aad907ae73-0. INFO 03-01 18:52:20 [logger.py:42] Received request cmpl-60c00a56680b46e98e3be055533bfba7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:20 [async_llm.py:261] Added request cmpl-60c00a56680b46e98e3be055533bfba7-0. INFO 03-01 18:52:21 [logger.py:42] Received request cmpl-0703a2effb8b48bb9ace34fe840c0b3c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:21 [async_llm.py:261] Added request cmpl-0703a2effb8b48bb9ace34fe840c0b3c-0. INFO 03-01 18:52:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:52:22 [logger.py:42] Received request cmpl-57360f9b50da482ab3dd4945cac07894-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:22 [async_llm.py:261] Added request cmpl-57360f9b50da482ab3dd4945cac07894-0. INFO 03-01 18:52:23 [logger.py:42] Received request cmpl-e3b4788dacd049c69ec9951e0aec6a2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:23 [async_llm.py:261] Added request cmpl-e3b4788dacd049c69ec9951e0aec6a2a-0. INFO 03-01 18:52:24 [logger.py:42] Received request cmpl-49d8ae0bc7e64ff69f8b2915e4312a9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:24 [async_llm.py:261] Added request cmpl-49d8ae0bc7e64ff69f8b2915e4312a9c-0. INFO 03-01 18:52:25 [logger.py:42] Received request cmpl-7f61af53510b493286f1a6f513282a60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:25 [async_llm.py:261] Added request cmpl-7f61af53510b493286f1a6f513282a60-0. INFO 03-01 18:52:27 [logger.py:42] Received request cmpl-cf43d36c4eab44b994bd8cd1065dec47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:27 [async_llm.py:261] Added request cmpl-cf43d36c4eab44b994bd8cd1065dec47-0. INFO 03-01 18:52:28 [logger.py:42] Received request cmpl-d04ea41561934f5ab4ee88159ba11867-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:28 [async_llm.py:261] Added request cmpl-d04ea41561934f5ab4ee88159ba11867-0. INFO 03-01 18:52:29 [logger.py:42] Received request cmpl-6de15140bbfa4ddd8eb920c81b11418b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:29 [async_llm.py:261] Added request cmpl-6de15140bbfa4ddd8eb920c81b11418b-0. INFO 03-01 18:52:30 [logger.py:42] Received request cmpl-0d5fde54932e4d40915ef8e9ae31bacf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:30 [async_llm.py:261] Added request cmpl-0d5fde54932e4d40915ef8e9ae31bacf-0. INFO 03-01 18:52:31 [logger.py:42] Received request cmpl-bbbaaddec4d3487c80ded2ee7952b2bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:31 [async_llm.py:261] Added request cmpl-bbbaaddec4d3487c80ded2ee7952b2bd-0. INFO 03-01 18:52:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:52:32 [logger.py:42] Received request cmpl-0e63378a47eb4a979f3f1f5663e6cd84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:32 [async_llm.py:261] Added request cmpl-0e63378a47eb4a979f3f1f5663e6cd84-0. INFO 03-01 18:52:34 [logger.py:42] Received request cmpl-b870b8a72163462f836e5bd1910a4783-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:34 [async_llm.py:261] Added request cmpl-b870b8a72163462f836e5bd1910a4783-0. INFO 03-01 18:52:35 [logger.py:42] Received request cmpl-fd2fa8277f16424d8b43b362b6ce770a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:35 [async_llm.py:261] Added request cmpl-fd2fa8277f16424d8b43b362b6ce770a-0. INFO 03-01 18:52:36 [logger.py:42] Received request cmpl-a2256043416f4c93b3c011c41c7cba00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:36 [async_llm.py:261] Added request cmpl-a2256043416f4c93b3c011c41c7cba00-0. INFO 03-01 18:52:37 [logger.py:42] Received request cmpl-d47e0dfc95a84692aa600da046061ec4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:37 [async_llm.py:261] Added request cmpl-d47e0dfc95a84692aa600da046061ec4-0. INFO 03-01 18:52:38 [logger.py:42] Received request cmpl-dbcb8f1fa31e4fddb9e39dd235547ee6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:38 [async_llm.py:261] Added request cmpl-dbcb8f1fa31e4fddb9e39dd235547ee6-0. INFO 03-01 18:52:39 [logger.py:42] Received request cmpl-8061aefb580a4480abc988c7be92736a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:39 [async_llm.py:261] Added request cmpl-8061aefb580a4480abc988c7be92736a-0. INFO 03-01 18:52:41 [logger.py:42] Received request cmpl-fa620eec6ad94742ac0a66b7dc8c3c3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:41 [async_llm.py:261] Added request cmpl-fa620eec6ad94742ac0a66b7dc8c3c3e-0. INFO 03-01 18:52:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:52:42 [logger.py:42] Received request cmpl-e8466f529c9545fa8755dfb0a2e3cc6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:42 [async_llm.py:261] Added request cmpl-e8466f529c9545fa8755dfb0a2e3cc6a-0. INFO 03-01 18:52:43 [logger.py:42] Received request cmpl-1942e7bea5ee4659a965592f669f921b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:43 [async_llm.py:261] Added request cmpl-1942e7bea5ee4659a965592f669f921b-0. INFO 03-01 18:52:44 [logger.py:42] Received request cmpl-278bbf88d7c1426c8764865550277a98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:44 [async_llm.py:261] Added request cmpl-278bbf88d7c1426c8764865550277a98-0. INFO 03-01 18:52:45 [logger.py:42] Received request cmpl-de25f17fe650415285b90ca9a086696c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:45 [async_llm.py:261] Added request cmpl-de25f17fe650415285b90ca9a086696c-0. INFO 03-01 18:52:46 [logger.py:42] Received request cmpl-9a4646b990dc478b8208102e8d8d954f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:46 [async_llm.py:261] Added request cmpl-9a4646b990dc478b8208102e8d8d954f-0. INFO 03-01 18:52:48 [logger.py:42] Received request cmpl-d01757ce3f7a401ea537473892da2eb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:48 [async_llm.py:261] Added request cmpl-d01757ce3f7a401ea537473892da2eb5-0. INFO 03-01 18:52:49 [logger.py:42] Received request cmpl-405513e54d9c4b639af491fbc8980c34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:49 [async_llm.py:261] Added request cmpl-405513e54d9c4b639af491fbc8980c34-0. INFO 03-01 18:52:50 [logger.py:42] Received request cmpl-f5d54467f67e4c1c963188cd5a2d387e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:50 [async_llm.py:261] Added request cmpl-f5d54467f67e4c1c963188cd5a2d387e-0. INFO 03-01 18:52:51 [logger.py:42] Received request cmpl-f1d0e6744efd4e04a14a7fedf0a305d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:51 [async_llm.py:261] Added request cmpl-f1d0e6744efd4e04a14a7fedf0a305d7-0. INFO 03-01 18:52:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:52:52 [logger.py:42] Received request cmpl-175c8c9f931740c98cd40fb195e5f30c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:52 [async_llm.py:261] Added request cmpl-175c8c9f931740c98cd40fb195e5f30c-0. INFO 03-01 18:52:53 [logger.py:42] Received request cmpl-f740b9c2aaec497c81d3038c0ccea4da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:53 [async_llm.py:261] Added request cmpl-f740b9c2aaec497c81d3038c0ccea4da-0. INFO 03-01 18:52:54 [logger.py:42] Received request cmpl-b3eb96875d514b00bd67e515fa0cf8b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:54 [async_llm.py:261] Added request cmpl-b3eb96875d514b00bd67e515fa0cf8b6-0. INFO 03-01 18:52:56 [logger.py:42] Received request cmpl-60930ca21c2147caa299bfea6022ab15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:56 [async_llm.py:261] Added request cmpl-60930ca21c2147caa299bfea6022ab15-0. INFO 03-01 18:52:57 [logger.py:42] Received request cmpl-da51c1c6a77a4511bf6025df3dc00800-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:57 [async_llm.py:261] Added request cmpl-da51c1c6a77a4511bf6025df3dc00800-0. INFO 03-01 18:52:58 [logger.py:42] Received request cmpl-0cf580f4a65e42d485a7455f8d95cc46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:58 [async_llm.py:261] Added request cmpl-0cf580f4a65e42d485a7455f8d95cc46-0. INFO 03-01 18:52:59 [logger.py:42] Received request cmpl-55adf51dd2df417cb4e8081d492a64d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:52:59 [async_llm.py:261] Added request cmpl-55adf51dd2df417cb4e8081d492a64d8-0. INFO 03-01 18:53:00 [logger.py:42] Received request cmpl-13abdb8a18d347f9bc4ffa40798e0eb6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:00 [async_llm.py:261] Added request cmpl-13abdb8a18d347f9bc4ffa40798e0eb6-0. INFO 03-01 18:53:01 [logger.py:42] Received request cmpl-c5b8f28092034a2bbe99733c4382a293-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:01 [async_llm.py:261] Added request cmpl-c5b8f28092034a2bbe99733c4382a293-0. INFO 03-01 18:53:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 18:53:03 [logger.py:42] Received request cmpl-8333360db7824f72b3dfc41d4b84e1a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:03 [async_llm.py:261] Added request cmpl-8333360db7824f72b3dfc41d4b84e1a0-0. INFO 03-01 18:53:04 [logger.py:42] Received request cmpl-235668426e824396a829ce3fb1064685-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:04 [async_llm.py:261] Added request cmpl-235668426e824396a829ce3fb1064685-0. INFO 03-01 18:53:05 [logger.py:42] Received request cmpl-d66307c7b0d34a198ce9d88a38f89b0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:05 [async_llm.py:261] Added request cmpl-d66307c7b0d34a198ce9d88a38f89b0b-0. INFO 03-01 18:53:06 [logger.py:42] Received request cmpl-37b8dd9f29d64ab6a5f1fe7e81f87968-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:06 [async_llm.py:261] Added request cmpl-37b8dd9f29d64ab6a5f1fe7e81f87968-0. INFO 03-01 18:53:07 [logger.py:42] Received request cmpl-c5481dd32e37430fbc66397abe22eeb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:07 [async_llm.py:261] Added request cmpl-c5481dd32e37430fbc66397abe22eeb3-0. INFO 03-01 18:53:08 [logger.py:42] Received request cmpl-4ed1353b1d764bd38430f7dbe9334d84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:08 [async_llm.py:261] Added request cmpl-4ed1353b1d764bd38430f7dbe9334d84-0. INFO 03-01 18:53:10 [logger.py:42] Received request cmpl-831db58b6eb94746ab684228b8893500-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:10 [async_llm.py:261] Added request cmpl-831db58b6eb94746ab684228b8893500-0. INFO 03-01 18:53:11 [logger.py:42] Received request cmpl-9245aa97dbb64bcb8cbf0714a371570e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:11 [async_llm.py:261] Added request cmpl-9245aa97dbb64bcb8cbf0714a371570e-0. INFO 03-01 18:53:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:53:12 [logger.py:42] Received request cmpl-4273f8d50ee943fd856961beae8fc930-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:12 [async_llm.py:261] Added request cmpl-4273f8d50ee943fd856961beae8fc930-0. INFO 03-01 18:53:13 [logger.py:42] Received request cmpl-4cf175e02f2d4ed0a1865edb8726e984-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:13 [async_llm.py:261] Added request cmpl-4cf175e02f2d4ed0a1865edb8726e984-0. INFO 03-01 18:53:14 [logger.py:42] Received request cmpl-1f114e566b084278852e54f69e2108f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:14 [async_llm.py:261] Added request cmpl-1f114e566b084278852e54f69e2108f9-0. INFO 03-01 18:53:15 [logger.py:42] Received request cmpl-4d466a78a92b4dbda17a24bde46a7424-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:15 [async_llm.py:261] Added request cmpl-4d466a78a92b4dbda17a24bde46a7424-0. INFO 03-01 18:53:17 [logger.py:42] Received request cmpl-20cb9e18411e4966ab9ec5fe24b6810d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:17 [async_llm.py:261] Added request cmpl-20cb9e18411e4966ab9ec5fe24b6810d-0. INFO 03-01 18:53:18 [logger.py:42] Received request cmpl-29ffeda438e74db592fcec80095f5ab5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:18 [async_llm.py:261] Added request cmpl-29ffeda438e74db592fcec80095f5ab5-0. INFO 03-01 18:53:19 [logger.py:42] Received request cmpl-bb3680d66fcb46fa91ba807513c36085-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:19 [async_llm.py:261] Added request cmpl-bb3680d66fcb46fa91ba807513c36085-0. INFO 03-01 18:53:20 [logger.py:42] Received request cmpl-04cf544fb9a542d387e7795befa2fe36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:20 [async_llm.py:261] Added request cmpl-04cf544fb9a542d387e7795befa2fe36-0. INFO 03-01 18:53:21 [logger.py:42] Received request cmpl-2f1af5978895431e98add54c37e1f938-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:21 [async_llm.py:261] Added request cmpl-2f1af5978895431e98add54c37e1f938-0. INFO 03-01 18:53:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:53:22 [logger.py:42] Received request cmpl-1f2f1e77ef144f50bad92a87cb8cde99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:22 [async_llm.py:261] Added request cmpl-1f2f1e77ef144f50bad92a87cb8cde99-0. INFO 03-01 18:53:23 [logger.py:42] Received request cmpl-b8c81ce27a0148499d7cd4cb8aafaf68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:23 [async_llm.py:261] Added request cmpl-b8c81ce27a0148499d7cd4cb8aafaf68-0. INFO 03-01 18:53:25 [logger.py:42] Received request cmpl-f25933dcb91647f7ac62ed1dc5958397-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:25 [async_llm.py:261] Added request cmpl-f25933dcb91647f7ac62ed1dc5958397-0. INFO 03-01 18:53:26 [logger.py:42] Received request cmpl-9153262529eb41829f8b3363b45af359-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:26 [async_llm.py:261] Added request cmpl-9153262529eb41829f8b3363b45af359-0. INFO 03-01 18:53:27 [logger.py:42] Received request cmpl-30c624ca46964ab793717ad9de22b7a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:27 [async_llm.py:261] Added request cmpl-30c624ca46964ab793717ad9de22b7a9-0. INFO 03-01 18:53:28 [logger.py:42] Received request cmpl-90bcb88e26b2483885ef78df8bc38f61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:28 [async_llm.py:261] Added request cmpl-90bcb88e26b2483885ef78df8bc38f61-0. INFO 03-01 18:53:29 [logger.py:42] Received request cmpl-7907ecf0f4c54822bf09bc8cfe4886e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:29 [async_llm.py:261] Added request cmpl-7907ecf0f4c54822bf09bc8cfe4886e9-0. INFO 03-01 18:53:30 [logger.py:42] Received request cmpl-d3e4c248ba984cc798cb6ced50ef223b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:30 [async_llm.py:261] Added request cmpl-d3e4c248ba984cc798cb6ced50ef223b-0. INFO 03-01 18:53:32 [logger.py:42] Received request cmpl-d17e86d75ea744dc9bbf2e9758dccd44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:32 [async_llm.py:261] Added request cmpl-d17e86d75ea744dc9bbf2e9758dccd44-0. INFO 03-01 18:53:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:53:33 [logger.py:42] Received request cmpl-ed3b2f85db174f4e9b8db3c588a1f58e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:33 [async_llm.py:261] Added request cmpl-ed3b2f85db174f4e9b8db3c588a1f58e-0. INFO 03-01 18:53:34 [logger.py:42] Received request cmpl-d9fe71d27c3b4f009d24bb2ea3e71648-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:34 [async_llm.py:261] Added request cmpl-d9fe71d27c3b4f009d24bb2ea3e71648-0. INFO 03-01 18:53:35 [logger.py:42] Received request cmpl-3c1d84233fce42f6bfa8e8cab3ee867d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:35 [async_llm.py:261] Added request cmpl-3c1d84233fce42f6bfa8e8cab3ee867d-0. INFO 03-01 18:53:36 [logger.py:42] Received request cmpl-76be615893eb48dbab82414c92c03573-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:36 [async_llm.py:261] Added request cmpl-76be615893eb48dbab82414c92c03573-0. INFO 03-01 18:53:37 [logger.py:42] Received request cmpl-9822a9335b224b77b32d3a6b5f6dbfb5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:37 [async_llm.py:261] Added request cmpl-9822a9335b224b77b32d3a6b5f6dbfb5-0. INFO 03-01 18:53:39 [logger.py:42] Received request cmpl-9c15da42ba624f76a0321bf28baa1eef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:39 [async_llm.py:261] Added request cmpl-9c15da42ba624f76a0321bf28baa1eef-0. INFO 03-01 18:53:40 [logger.py:42] Received request cmpl-08a9c7e9ea544c66b976e1011b751c97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:40 [async_llm.py:261] Added request cmpl-08a9c7e9ea544c66b976e1011b751c97-0. INFO 03-01 18:53:41 [logger.py:42] Received request cmpl-95c56189b041433e9f4c96d02898f598-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:41 [async_llm.py:261] Added request cmpl-95c56189b041433e9f4c96d02898f598-0. INFO 03-01 18:53:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:53:42 [logger.py:42] Received request cmpl-13e102350770439f9975602aaef89702-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:42 [async_llm.py:261] Added request cmpl-13e102350770439f9975602aaef89702-0. INFO 03-01 18:53:43 [logger.py:42] Received request cmpl-96efec09b8b0431688e733bff74d9c1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:43 [async_llm.py:261] Added request cmpl-96efec09b8b0431688e733bff74d9c1a-0. INFO 03-01 18:53:44 [logger.py:42] Received request cmpl-dece7211346a4411a9c0ff815679176a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:44 [async_llm.py:261] Added request cmpl-dece7211346a4411a9c0ff815679176a-0. INFO 03-01 18:53:45 [logger.py:42] Received request cmpl-5a608e2e2c3343189ee4db7f561cd8e9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:45 [async_llm.py:261] Added request cmpl-5a608e2e2c3343189ee4db7f561cd8e9-0. INFO 03-01 18:53:47 [logger.py:42] Received request cmpl-63ce8fdd34fa4bea845c185e3c2dea6a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:47 [async_llm.py:261] Added request cmpl-63ce8fdd34fa4bea845c185e3c2dea6a-0. INFO 03-01 18:53:48 [logger.py:42] Received request cmpl-0a047ba2097c48be938f7ec5ee1c3af4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:48 [async_llm.py:261] Added request cmpl-0a047ba2097c48be938f7ec5ee1c3af4-0. INFO 03-01 18:53:49 [logger.py:42] Received request cmpl-5ff98b75d8934785922c7b4f14d979d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:49 [async_llm.py:261] Added request cmpl-5ff98b75d8934785922c7b4f14d979d9-0. INFO 03-01 18:53:50 [logger.py:42] Received request cmpl-6f5a2b3439cb407192ab5d342d923908-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:50 [async_llm.py:261] Added request cmpl-6f5a2b3439cb407192ab5d342d923908-0. INFO 03-01 18:53:51 [logger.py:42] Received request cmpl-4b9cdddcb53f44158ce5fd2a7ef1a1b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:51 [async_llm.py:261] Added request cmpl-4b9cdddcb53f44158ce5fd2a7ef1a1b7-0. INFO 03-01 18:53:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:53:52 [logger.py:42] Received request cmpl-6805d4d798684ae88960c6e1893561a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:52 [async_llm.py:261] Added request cmpl-6805d4d798684ae88960c6e1893561a2-0. INFO 03-01 18:53:54 [logger.py:42] Received request cmpl-33a9f293a5ea4360a561ab30648b4062-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:54 [async_llm.py:261] Added request cmpl-33a9f293a5ea4360a561ab30648b4062-0. INFO 03-01 18:53:55 [logger.py:42] Received request cmpl-4e9fc4fe6f9545b7a44ff1e20c5e1f6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:55 [async_llm.py:261] Added request cmpl-4e9fc4fe6f9545b7a44ff1e20c5e1f6d-0. INFO 03-01 18:53:56 [logger.py:42] Received request cmpl-653a40384a574b94908a60a7c4f054d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:56 [async_llm.py:261] Added request cmpl-653a40384a574b94908a60a7c4f054d3-0. INFO 03-01 18:53:57 [logger.py:42] Received request cmpl-c5225eb72d1b4d3abfd7b28fdfcc7aa9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:57 [async_llm.py:261] Added request cmpl-c5225eb72d1b4d3abfd7b28fdfcc7aa9-0. INFO 03-01 18:53:58 [logger.py:42] Received request cmpl-a561c5eec7e34a6a8e20f6492204c897-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:58 [async_llm.py:261] Added request cmpl-a561c5eec7e34a6a8e20f6492204c897-0. INFO 03-01 18:53:59 [logger.py:42] Received request cmpl-d25a57b36fe74de0adb85459de072288-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:53:59 [async_llm.py:261] Added request cmpl-d25a57b36fe74de0adb85459de072288-0. INFO 03-01 18:54:01 [logger.py:42] Received request cmpl-9a8964c6bb88474ea2b8307e5a0d1d31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:01 [async_llm.py:261] Added request cmpl-9a8964c6bb88474ea2b8307e5a0d1d31-0. INFO 03-01 18:54:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:54:02 [logger.py:42] Received request cmpl-410604162b7b4ba6be86297a41e50305-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:02 [async_llm.py:261] Added request cmpl-410604162b7b4ba6be86297a41e50305-0. INFO 03-01 18:54:03 [logger.py:42] Received request cmpl-be2f9dcea0af45e8b9e6dede398396bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:03 [async_llm.py:261] Added request cmpl-be2f9dcea0af45e8b9e6dede398396bf-0. INFO 03-01 18:54:04 [logger.py:42] Received request cmpl-0155c0fe4a864942910ec8a19b630f65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:04 [async_llm.py:261] Added request cmpl-0155c0fe4a864942910ec8a19b630f65-0. INFO 03-01 18:54:05 [logger.py:42] Received request cmpl-cdeb139b76db4169886fb37e838a6366-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:05 [async_llm.py:261] Added request cmpl-cdeb139b76db4169886fb37e838a6366-0. INFO 03-01 18:54:06 [logger.py:42] Received request cmpl-95f66333f9b3464488d92594b376820b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:06 [async_llm.py:261] Added request cmpl-95f66333f9b3464488d92594b376820b-0. INFO 03-01 18:54:08 [logger.py:42] Received request cmpl-f9077614e424446c87d126fc588a6f0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:08 [async_llm.py:261] Added request cmpl-f9077614e424446c87d126fc588a6f0d-0. INFO 03-01 18:54:09 [logger.py:42] Received request cmpl-222d98b744dc47f993ead94b0a48c7c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:09 [async_llm.py:261] Added request cmpl-222d98b744dc47f993ead94b0a48c7c4-0. INFO 03-01 18:54:10 [logger.py:42] Received request cmpl-34ac793f2c224924bc49d4626ea9c0e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:10 [async_llm.py:261] Added request cmpl-34ac793f2c224924bc49d4626ea9c0e6-0. INFO 03-01 18:54:11 [logger.py:42] Received request cmpl-a5490963155b428080c7dc3af3b99b04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:11 [async_llm.py:261] Added request cmpl-a5490963155b428080c7dc3af3b99b04-0. INFO 03-01 18:54:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:54:12 [logger.py:42] Received request cmpl-b1b7a4a7d5fe4268ac1b049fd095f356-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:12 [async_llm.py:261] Added request cmpl-b1b7a4a7d5fe4268ac1b049fd095f356-0. INFO 03-01 18:54:14 [logger.py:42] Received request cmpl-8e7e9dabeead435d93f854ff9c194d21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:14 [async_llm.py:261] Added request cmpl-8e7e9dabeead435d93f854ff9c194d21-0. INFO 03-01 18:54:15 [logger.py:42] Received request cmpl-c9227734d621464bb1e1c7f900e62e5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:15 [async_llm.py:261] Added request cmpl-c9227734d621464bb1e1c7f900e62e5c-0. INFO 03-01 18:54:16 [logger.py:42] Received request cmpl-0e96247115d84b09861da331ac5cd9ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:16 [async_llm.py:261] Added request cmpl-0e96247115d84b09861da331ac5cd9ab-0. INFO 03-01 18:54:17 [logger.py:42] Received request cmpl-58ee893955954a55a93cbb286f6e0456-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:17 [async_llm.py:261] Added request cmpl-58ee893955954a55a93cbb286f6e0456-0. INFO 03-01 18:54:18 [logger.py:42] Received request cmpl-644ec8a631ca485a85f6f2deb3290d01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:18 [async_llm.py:261] Added request cmpl-644ec8a631ca485a85f6f2deb3290d01-0. INFO 03-01 18:54:19 [logger.py:42] Received request cmpl-57c807cf034442499697b328fac107b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:19 [async_llm.py:261] Added request cmpl-57c807cf034442499697b328fac107b0-0. INFO 03-01 18:54:21 [logger.py:42] Received request cmpl-0dbe93821920472a9ccbba1dc6570501-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:21 [async_llm.py:261] Added request cmpl-0dbe93821920472a9ccbba1dc6570501-0. INFO 03-01 18:54:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:54:22 [logger.py:42] Received request cmpl-84c5f88067cb43a6a645eae81a7a931d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:22 [async_llm.py:261] Added request cmpl-84c5f88067cb43a6a645eae81a7a931d-0. INFO 03-01 18:54:23 [logger.py:42] Received request cmpl-3cd5a907d9a24b6298cec2b47045af05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:23 [async_llm.py:261] Added request cmpl-3cd5a907d9a24b6298cec2b47045af05-0. INFO 03-01 18:54:24 [logger.py:42] Received request cmpl-04be634d7b3d4ef3aa7a12b3b737c07f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:24 [async_llm.py:261] Added request cmpl-04be634d7b3d4ef3aa7a12b3b737c07f-0. INFO 03-01 18:54:25 [logger.py:42] Received request cmpl-7f35cf8529b741f88c050d0a93284f62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:25 [async_llm.py:261] Added request cmpl-7f35cf8529b741f88c050d0a93284f62-0. INFO 03-01 18:54:26 [logger.py:42] Received request cmpl-81f7181e32fc4e34b453286db55bd0d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:26 [async_llm.py:261] Added request cmpl-81f7181e32fc4e34b453286db55bd0d3-0. INFO 03-01 18:54:28 [logger.py:42] Received request cmpl-0b4e17eee26f4d69b8a9637ded31d36f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:28 [async_llm.py:261] Added request cmpl-0b4e17eee26f4d69b8a9637ded31d36f-0. INFO 03-01 18:54:29 [logger.py:42] Received request cmpl-b9c9168e3b5c449d90db1dd47ce85cce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:29 [async_llm.py:261] Added request cmpl-b9c9168e3b5c449d90db1dd47ce85cce-0. INFO 03-01 18:54:30 [logger.py:42] Received request cmpl-bd1c85204e104c9fb073751f8732ad49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:30 [async_llm.py:261] Added request cmpl-bd1c85204e104c9fb073751f8732ad49-0. INFO 03-01 18:54:31 [logger.py:42] Received request cmpl-19bd6e3e76314d9d896a746090fac478-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:31 [async_llm.py:261] Added request cmpl-19bd6e3e76314d9d896a746090fac478-0. INFO 03-01 18:54:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:54:32 [logger.py:42] Received request cmpl-cff3f7fc8e8c4ee0aea9c5d5b437ab98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:32 [async_llm.py:261] Added request cmpl-cff3f7fc8e8c4ee0aea9c5d5b437ab98-0. INFO 03-01 18:54:33 [logger.py:42] Received request cmpl-97ed89e851a54885b6b2dcdb17623a9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:33 [async_llm.py:261] Added request cmpl-97ed89e851a54885b6b2dcdb17623a9d-0. INFO 03-01 18:54:35 [logger.py:42] Received request cmpl-eff3b025a1fb493b947ec0b4be0e4329-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:35 [async_llm.py:261] Added request cmpl-eff3b025a1fb493b947ec0b4be0e4329-0. INFO 03-01 18:54:36 [logger.py:42] Received request cmpl-a7ae691d32a04b6bac012d189a1718f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:36 [async_llm.py:261] Added request cmpl-a7ae691d32a04b6bac012d189a1718f7-0. INFO 03-01 18:54:37 [logger.py:42] Received request cmpl-9f3959c69a2b40b893b954a6cd151f8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:37 [async_llm.py:261] Added request cmpl-9f3959c69a2b40b893b954a6cd151f8e-0. INFO 03-01 18:54:38 [logger.py:42] Received request cmpl-01be1027a3bf4c608b0d76dc0d472cd4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:38 [async_llm.py:261] Added request cmpl-01be1027a3bf4c608b0d76dc0d472cd4-0. INFO 03-01 18:54:39 [logger.py:42] Received request cmpl-585a34d599934616982a8bf016abb862-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:39 [async_llm.py:261] Added request cmpl-585a34d599934616982a8bf016abb862-0. INFO 03-01 18:54:41 [logger.py:42] Received request cmpl-181743d9a26c462db87b4b4f63f598e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:41 [async_llm.py:261] Added request cmpl-181743d9a26c462db87b4b4f63f598e0-0. INFO 03-01 18:54:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:54:42 [logger.py:42] Received request cmpl-b927261427464df3a67d9128011a4f97-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:42 [async_llm.py:261] Added request cmpl-b927261427464df3a67d9128011a4f97-0. INFO 03-01 18:54:43 [logger.py:42] Received request cmpl-62c1374d9dbf47349ed2e8a8a7713938-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:43 [async_llm.py:261] Added request cmpl-62c1374d9dbf47349ed2e8a8a7713938-0. INFO 03-01 18:54:44 [logger.py:42] Received request cmpl-8e9d91b476d4409ca93892fbd73c83c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:44 [async_llm.py:261] Added request cmpl-8e9d91b476d4409ca93892fbd73c83c7-0. INFO 03-01 18:54:45 [logger.py:42] Received request cmpl-aac657480c3c4379874b6159217264c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:45 [async_llm.py:261] Added request cmpl-aac657480c3c4379874b6159217264c2-0. INFO 03-01 18:54:47 [logger.py:42] Received request cmpl-56ca0e458fb84dd4ac0533715523d74c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:47 [async_llm.py:261] Added request cmpl-56ca0e458fb84dd4ac0533715523d74c-0. INFO 03-01 18:54:48 [logger.py:42] Received request cmpl-836ffce9d9d343b38366b2011cc446fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:48 [async_llm.py:261] Added request cmpl-836ffce9d9d343b38366b2011cc446fb-0. INFO 03-01 18:54:49 [logger.py:42] Received request cmpl-dfdc4e116aeb41568b8efc181ee9c760-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:49 [async_llm.py:261] Added request cmpl-dfdc4e116aeb41568b8efc181ee9c760-0. INFO 03-01 18:54:50 [logger.py:42] Received request cmpl-879ab56731044884a2ce574d3e636417-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:50 [async_llm.py:261] Added request cmpl-879ab56731044884a2ce574d3e636417-0. INFO 03-01 18:54:51 [logger.py:42] Received request cmpl-1d60cd74d43846d586345e3eccdc6a27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:51 [async_llm.py:261] Added request cmpl-1d60cd74d43846d586345e3eccdc6a27-0. INFO 03-01 18:54:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:54:52 [logger.py:42] Received request cmpl-890036f32cb5429989ced70139436076-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:52 [async_llm.py:261] Added request cmpl-890036f32cb5429989ced70139436076-0. INFO 03-01 18:54:54 [logger.py:42] Received request cmpl-add0e3fcb0364c65a70fbb6dc0084c56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:54 [async_llm.py:261] Added request cmpl-add0e3fcb0364c65a70fbb6dc0084c56-0. INFO 03-01 18:54:55 [logger.py:42] Received request cmpl-a9c7eb62f45c4776b96707d5ec8379df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:55 [async_llm.py:261] Added request cmpl-a9c7eb62f45c4776b96707d5ec8379df-0. INFO 03-01 18:54:56 [logger.py:42] Received request cmpl-2f8a5f70fd17405eb0ff5504ad4cb5f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:56 [async_llm.py:261] Added request cmpl-2f8a5f70fd17405eb0ff5504ad4cb5f5-0. INFO 03-01 18:54:57 [logger.py:42] Received request cmpl-b1657abec9854c9aa7aac978a3c3802a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:57 [async_llm.py:261] Added request cmpl-b1657abec9854c9aa7aac978a3c3802a-0. INFO 03-01 18:54:58 [logger.py:42] Received request cmpl-387c02bc927e4a8fb40a95386016d5a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:58 [async_llm.py:261] Added request cmpl-387c02bc927e4a8fb40a95386016d5a7-0. INFO 03-01 18:54:59 [logger.py:42] Received request cmpl-423531b937024180a20dd1910d73f37d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:54:59 [async_llm.py:261] Added request cmpl-423531b937024180a20dd1910d73f37d-0. INFO 03-01 18:55:01 [logger.py:42] Received request cmpl-32b03dc67b8749d487e411bc0c6e2a48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:01 [async_llm.py:261] Added request cmpl-32b03dc67b8749d487e411bc0c6e2a48-0. INFO 03-01 18:55:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:55:02 [logger.py:42] Received request cmpl-ee10db81115b4d0fb05664538c27e8e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:02 [async_llm.py:261] Added request cmpl-ee10db81115b4d0fb05664538c27e8e7-0. INFO 03-01 18:55:03 [logger.py:42] Received request cmpl-0c6ffcf38db942bb8019a2fb19bd5fa5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:03 [async_llm.py:261] Added request cmpl-0c6ffcf38db942bb8019a2fb19bd5fa5-0. INFO 03-01 18:55:04 [logger.py:42] Received request cmpl-8f9803c633214fb9b4e9f5fbcd2f8080-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:04 [async_llm.py:261] Added request cmpl-8f9803c633214fb9b4e9f5fbcd2f8080-0. INFO 03-01 18:55:05 [logger.py:42] Received request cmpl-9c4d27f6b7654d62b29f653b4d6ddd11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:05 [async_llm.py:261] Added request cmpl-9c4d27f6b7654d62b29f653b4d6ddd11-0. INFO 03-01 18:55:06 [logger.py:42] Received request cmpl-8e3e4534a22b4b5eab0ce7ea3c21348c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:06 [async_llm.py:261] Added request cmpl-8e3e4534a22b4b5eab0ce7ea3c21348c-0. INFO 03-01 18:55:07 [logger.py:42] Received request cmpl-ba654cc76acf4f7196086f910155f9a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:07 [async_llm.py:261] Added request cmpl-ba654cc76acf4f7196086f910155f9a5-0. INFO 03-01 18:55:09 [logger.py:42] Received request cmpl-aeb0946dc2bb4b3db3394fd9143b5741-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:09 [async_llm.py:261] Added request cmpl-aeb0946dc2bb4b3db3394fd9143b5741-0. INFO 03-01 18:55:10 [logger.py:42] Received request cmpl-69a0c84fe03e4c94bd6bcd0078891962-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:10 [async_llm.py:261] Added request cmpl-69a0c84fe03e4c94bd6bcd0078891962-0. INFO 03-01 18:55:11 [logger.py:42] Received request cmpl-7b317a6920c44321aa6c957af17f391b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:11 [async_llm.py:261] Added request cmpl-7b317a6920c44321aa6c957af17f391b-0. INFO 03-01 18:55:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:55:12 [logger.py:42] Received request cmpl-acb66f2be05041c39b439c9ad19afba5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:12 [async_llm.py:261] Added request cmpl-acb66f2be05041c39b439c9ad19afba5-0. INFO 03-01 18:55:13 [logger.py:42] Received request cmpl-059405b2b4a644279fc394058180a6ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:13 [async_llm.py:261] Added request cmpl-059405b2b4a644279fc394058180a6ae-0. INFO 03-01 18:55:14 [logger.py:42] Received request cmpl-8069c0b5f17f46128a7e320e7f1a6ab3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:14 [async_llm.py:261] Added request cmpl-8069c0b5f17f46128a7e320e7f1a6ab3-0. INFO 03-01 18:55:16 [logger.py:42] Received request cmpl-d23dce33daa243a29696a089f8018167-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:16 [async_llm.py:261] Added request cmpl-d23dce33daa243a29696a089f8018167-0. INFO 03-01 18:55:17 [logger.py:42] Received request cmpl-5b9bdf33bb1b4db8880316c884a10ca5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:17 [async_llm.py:261] Added request cmpl-5b9bdf33bb1b4db8880316c884a10ca5-0. INFO 03-01 18:55:18 [logger.py:42] Received request cmpl-01bb8fd5360b434db815ec6e0713051d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:18 [async_llm.py:261] Added request cmpl-01bb8fd5360b434db815ec6e0713051d-0. INFO 03-01 18:55:19 [logger.py:42] Received request cmpl-7fe6956b0bc7445197b9e6357c077a60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:19 [async_llm.py:261] Added request cmpl-7fe6956b0bc7445197b9e6357c077a60-0. INFO 03-01 18:55:20 [logger.py:42] Received request cmpl-f774a91bc081417fb4555e32df99dfdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:20 [async_llm.py:261] Added request cmpl-f774a91bc081417fb4555e32df99dfdc-0. INFO 03-01 18:55:21 [logger.py:42] Received request cmpl-2da55210d57f432a8065c195089c1983-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:21 [async_llm.py:261] Added request cmpl-2da55210d57f432a8065c195089c1983-0. INFO 03-01 18:55:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:55:23 [logger.py:42] Received request cmpl-012c0d4bf8aa44bf831498a1f99079b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:23 [async_llm.py:261] Added request cmpl-012c0d4bf8aa44bf831498a1f99079b2-0. INFO 03-01 18:55:24 [logger.py:42] Received request cmpl-bfe78f45ccba4fa6ae1fdc9c44aabf30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:24 [async_llm.py:261] Added request cmpl-bfe78f45ccba4fa6ae1fdc9c44aabf30-0. INFO 03-01 18:55:25 [logger.py:42] Received request cmpl-0bd4a2406e434da8b7a88d9c6532ea8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:25 [async_llm.py:261] Added request cmpl-0bd4a2406e434da8b7a88d9c6532ea8d-0. INFO 03-01 18:55:26 [logger.py:42] Received request cmpl-b50be5859ece453e8a33b43f4623820f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:26 [async_llm.py:261] Added request cmpl-b50be5859ece453e8a33b43f4623820f-0. INFO 03-01 18:55:27 [logger.py:42] Received request cmpl-e431e7f55aa144af8bf179c2cd67587c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:27 [async_llm.py:261] Added request cmpl-e431e7f55aa144af8bf179c2cd67587c-0. INFO 03-01 18:55:28 [logger.py:42] Received request cmpl-26671026df0e43f29667a142e811e877-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:28 [async_llm.py:261] Added request cmpl-26671026df0e43f29667a142e811e877-0. INFO 03-01 18:55:30 [logger.py:42] Received request cmpl-ab9836d1f1c243cb98d4f87f3834775f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:30 [async_llm.py:261] Added request cmpl-ab9836d1f1c243cb98d4f87f3834775f-0. INFO 03-01 18:55:31 [logger.py:42] Received request cmpl-eacd295d3d154955857454ab74a3ebe1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:31 [async_llm.py:261] Added request cmpl-eacd295d3d154955857454ab74a3ebe1-0. INFO 03-01 18:55:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:55:32 [logger.py:42] Received request cmpl-1453d695d2974599815aafaf23993924-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:32 [async_llm.py:261] Added request cmpl-1453d695d2974599815aafaf23993924-0. INFO 03-01 18:55:33 [logger.py:42] Received request cmpl-6385507344ad47d994d46784cb3a9596-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:33 [async_llm.py:261] Added request cmpl-6385507344ad47d994d46784cb3a9596-0. INFO 03-01 18:55:34 [logger.py:42] Received request cmpl-ce34c9cef23f42b184e7c5dbcdd1d7ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:34 [async_llm.py:261] Added request cmpl-ce34c9cef23f42b184e7c5dbcdd1d7ff-0. INFO 03-01 18:55:35 [logger.py:42] Received request cmpl-43e686dd5a864ed8b1bea16b77634567-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:35 [async_llm.py:261] Added request cmpl-43e686dd5a864ed8b1bea16b77634567-0. INFO 03-01 18:55:37 [logger.py:42] Received request cmpl-336f93b12ed64ea1a7a373fc93a3ec49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:37 [async_llm.py:261] Added request cmpl-336f93b12ed64ea1a7a373fc93a3ec49-0. INFO 03-01 18:55:38 [logger.py:42] Received request cmpl-4060171fe7d441b7ba8adf9f00ef712e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:38 [async_llm.py:261] Added request cmpl-4060171fe7d441b7ba8adf9f00ef712e-0. INFO 03-01 18:55:39 [logger.py:42] Received request cmpl-adaca5b6dceb440fad44017537493a4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:39 [async_llm.py:261] Added request cmpl-adaca5b6dceb440fad44017537493a4a-0. INFO 03-01 18:55:40 [logger.py:42] Received request cmpl-6d80f4ec09994bf886e417133ad2afc2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:40 [async_llm.py:261] Added request cmpl-6d80f4ec09994bf886e417133ad2afc2-0. INFO 03-01 18:55:41 [logger.py:42] Received request cmpl-bb6a9b542eb34c99966d2f093dcd69ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:41 [async_llm.py:261] Added request cmpl-bb6a9b542eb34c99966d2f093dcd69ee-0. INFO 03-01 18:55:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:55:42 [logger.py:42] Received request cmpl-fee1c19c94604b33841a1e7fa4828797-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:42 [async_llm.py:261] Added request cmpl-fee1c19c94604b33841a1e7fa4828797-0. INFO 03-01 18:55:43 [logger.py:42] Received request cmpl-896356db6c494b72ba1ac4fd4a5139f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:43 [async_llm.py:261] Added request cmpl-896356db6c494b72ba1ac4fd4a5139f0-0. INFO 03-01 18:55:45 [logger.py:42] Received request cmpl-43d8f3e576734053aed8724601237fc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:45 [async_llm.py:261] Added request cmpl-43d8f3e576734053aed8724601237fc4-0. INFO 03-01 18:55:46 [logger.py:42] Received request cmpl-183f3cebfa77424cbc5c3c1d17f60ed3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:46 [async_llm.py:261] Added request cmpl-183f3cebfa77424cbc5c3c1d17f60ed3-0. INFO 03-01 18:55:47 [logger.py:42] Received request cmpl-a17e773245e44e5fab405fbbaf1ee36d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:47 [async_llm.py:261] Added request cmpl-a17e773245e44e5fab405fbbaf1ee36d-0. INFO 03-01 18:55:48 [logger.py:42] Received request cmpl-88fe70182c654fbe9e60aa3c424a4844-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:48 [async_llm.py:261] Added request cmpl-88fe70182c654fbe9e60aa3c424a4844-0. INFO 03-01 18:55:49 [logger.py:42] Received request cmpl-8a428e5c8f61450b896df358c19d44bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:49 [async_llm.py:261] Added request cmpl-8a428e5c8f61450b896df358c19d44bc-0. INFO 03-01 18:55:50 [logger.py:42] Received request cmpl-96cb9bb0f23249deaabbb3c09919e05f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:50 [async_llm.py:261] Added request cmpl-96cb9bb0f23249deaabbb3c09919e05f-0. INFO 03-01 18:55:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:55:52 [logger.py:42] Received request cmpl-256b788cf2294ecfb7013a4b1be5095b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:52 [async_llm.py:261] Added request cmpl-256b788cf2294ecfb7013a4b1be5095b-0. INFO 03-01 18:55:53 [logger.py:42] Received request cmpl-07a84eb0fff449ec894e77c9aabca66c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:53 [async_llm.py:261] Added request cmpl-07a84eb0fff449ec894e77c9aabca66c-0. INFO 03-01 18:55:54 [logger.py:42] Received request cmpl-5aa396d428af427c8a8fe1dc141f2b65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:54 [async_llm.py:261] Added request cmpl-5aa396d428af427c8a8fe1dc141f2b65-0. INFO 03-01 18:55:55 [logger.py:42] Received request cmpl-9be0274d11e64c598c776aa0c995be48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:55 [async_llm.py:261] Added request cmpl-9be0274d11e64c598c776aa0c995be48-0. INFO 03-01 18:55:56 [logger.py:42] Received request cmpl-e2e69322e21f4cc98951ee03ab68d0b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:56 [async_llm.py:261] Added request cmpl-e2e69322e21f4cc98951ee03ab68d0b6-0. INFO 03-01 18:55:57 [logger.py:42] Received request cmpl-9b4efbc7499649b0b0c35da7ef4b4173-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:57 [async_llm.py:261] Added request cmpl-9b4efbc7499649b0b0c35da7ef4b4173-0. INFO 03-01 18:55:59 [logger.py:42] Received request cmpl-5a5269277cd24b8d9a24e95a064b2ef3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:55:59 [async_llm.py:261] Added request cmpl-5a5269277cd24b8d9a24e95a064b2ef3-0. INFO 03-01 18:56:00 [logger.py:42] Received request cmpl-cbcede365f724716a4c7b466bf630c4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:00 [async_llm.py:261] Added request cmpl-cbcede365f724716a4c7b466bf630c4a-0. INFO 03-01 18:56:01 [logger.py:42] Received request cmpl-c6f9723913b644d7bcf27fe79e069aaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:01 [async_llm.py:261] Added request cmpl-c6f9723913b644d7bcf27fe79e069aaf-0. INFO 03-01 18:56:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:56:02 [logger.py:42] Received request cmpl-eaf218f35f374f5db59c14298afc1450-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:02 [async_llm.py:261] Added request cmpl-eaf218f35f374f5db59c14298afc1450-0. INFO 03-01 18:56:03 [logger.py:42] Received request cmpl-19ee3a1bcb9344cc85ca3f329b637105-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:03 [async_llm.py:261] Added request cmpl-19ee3a1bcb9344cc85ca3f329b637105-0. INFO 03-01 18:56:04 [logger.py:42] Received request cmpl-598454bcdc2b45c3a158b057bb495f79-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:04 [async_llm.py:261] Added request cmpl-598454bcdc2b45c3a158b057bb495f79-0. INFO 03-01 18:56:06 [logger.py:42] Received request cmpl-1b66b62628a74e63b77cd430318d94c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:06 [async_llm.py:261] Added request cmpl-1b66b62628a74e63b77cd430318d94c9-0. INFO 03-01 18:56:07 [logger.py:42] Received request cmpl-e9f5e6d2bd23424f9d5fcbd0423ab5fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:07 [async_llm.py:261] Added request cmpl-e9f5e6d2bd23424f9d5fcbd0423ab5fa-0. INFO 03-01 18:56:08 [logger.py:42] Received request cmpl-019b3e0da2314a7da52bb2b15ba1272a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:08 [async_llm.py:261] Added request cmpl-019b3e0da2314a7da52bb2b15ba1272a-0. INFO 03-01 18:56:09 [logger.py:42] Received request cmpl-4f63055f324b4e04bcf3438610261525-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:09 [async_llm.py:261] Added request cmpl-4f63055f324b4e04bcf3438610261525-0. INFO 03-01 18:56:10 [logger.py:42] Received request cmpl-8737529b67824a3ca66f1a740a748b5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:10 [async_llm.py:261] Added request cmpl-8737529b67824a3ca66f1a740a748b5c-0. INFO 03-01 18:56:11 [logger.py:42] Received request cmpl-39fd7ac9ac5641cb8f9983cd54a2aec7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:11 [async_llm.py:261] Added request cmpl-39fd7ac9ac5641cb8f9983cd54a2aec7-0. INFO 03-01 18:56:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:56:12 [logger.py:42] Received request cmpl-18d5731118e840b1b403869e2051b87e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:12 [async_llm.py:261] Added request cmpl-18d5731118e840b1b403869e2051b87e-0. INFO 03-01 18:56:14 [logger.py:42] Received request cmpl-d612dd10bad940de912062aee2cce390-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:14 [async_llm.py:261] Added request cmpl-d612dd10bad940de912062aee2cce390-0. INFO 03-01 18:56:15 [logger.py:42] Received request cmpl-10e3be4fee464ded8c1003a3ab2c9bce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:15 [async_llm.py:261] Added request cmpl-10e3be4fee464ded8c1003a3ab2c9bce-0. INFO 03-01 18:56:16 [logger.py:42] Received request cmpl-f7fc48614a4b4baa8176c003389df573-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:16 [async_llm.py:261] Added request cmpl-f7fc48614a4b4baa8176c003389df573-0. INFO 03-01 18:56:17 [logger.py:42] Received request cmpl-c29faa60b9d04c168bab3c66e1da5b71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:17 [async_llm.py:261] Added request cmpl-c29faa60b9d04c168bab3c66e1da5b71-0. INFO 03-01 18:56:18 [logger.py:42] Received request cmpl-df3a3ddd1068478aa9927f795a959e3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:18 [async_llm.py:261] Added request cmpl-df3a3ddd1068478aa9927f795a959e3b-0. INFO 03-01 18:56:19 [logger.py:42] Received request cmpl-0febce8057d14939b4ef511afbc13679-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:19 [async_llm.py:261] Added request cmpl-0febce8057d14939b4ef511afbc13679-0. INFO 03-01 18:56:21 [logger.py:42] Received request cmpl-9d15e3b388834461a7ee09a7cc54b4d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:21 [async_llm.py:261] Added request cmpl-9d15e3b388834461a7ee09a7cc54b4d8-0. INFO 03-01 18:56:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:56:22 [logger.py:42] Received request cmpl-f874bed6b35341bfb38501b57e0a9ea6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:22 [async_llm.py:261] Added request cmpl-f874bed6b35341bfb38501b57e0a9ea6-0. INFO 03-01 18:56:23 [logger.py:42] Received request cmpl-1e65504211804b8c83704fdf57254cb4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:23 [async_llm.py:261] Added request cmpl-1e65504211804b8c83704fdf57254cb4-0. INFO 03-01 18:56:24 [logger.py:42] Received request cmpl-145b6636ed4a488aaeafd86a78a07cc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:24 [async_llm.py:261] Added request cmpl-145b6636ed4a488aaeafd86a78a07cc0-0. INFO 03-01 18:56:25 [logger.py:42] Received request cmpl-1dfbe31ddbdf44e581a856aa028a0708-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:25 [async_llm.py:261] Added request cmpl-1dfbe31ddbdf44e581a856aa028a0708-0. INFO 03-01 18:56:26 [logger.py:42] Received request cmpl-a1a1214e0914416cae845c629eb8d0ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:26 [async_llm.py:261] Added request cmpl-a1a1214e0914416cae845c629eb8d0ce-0. INFO 03-01 18:56:28 [logger.py:42] Received request cmpl-078928a0b6f240eda55ccd6180923002-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:28 [async_llm.py:261] Added request cmpl-078928a0b6f240eda55ccd6180923002-0. INFO 03-01 18:56:29 [logger.py:42] Received request cmpl-78eedf3e4a834fd3bedada6871cf200d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:29 [async_llm.py:261] Added request cmpl-78eedf3e4a834fd3bedada6871cf200d-0. INFO 03-01 18:56:30 [logger.py:42] Received request cmpl-73c41cd1fd774176b2af93e825706d74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:30 [async_llm.py:261] Added request cmpl-73c41cd1fd774176b2af93e825706d74-0. INFO 03-01 18:56:31 [logger.py:42] Received request cmpl-50ce5321383d402aa025564fe6fc49da-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:31 [async_llm.py:261] Added request cmpl-50ce5321383d402aa025564fe6fc49da-0. INFO 03-01 18:56:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:56:32 [logger.py:42] Received request cmpl-c54a88f868a94a2eba49085a785f2c5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:32 [async_llm.py:261] Added request cmpl-c54a88f868a94a2eba49085a785f2c5f-0. INFO 03-01 18:56:33 [logger.py:42] Received request cmpl-00c2692477bb42ebba51b51b89217cad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:33 [async_llm.py:261] Added request cmpl-00c2692477bb42ebba51b51b89217cad-0. INFO 03-01 18:56:35 [logger.py:42] Received request cmpl-ca41dc5d8c1a41e1931ee7dd572f6e9c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:35 [async_llm.py:261] Added request cmpl-ca41dc5d8c1a41e1931ee7dd572f6e9c-0. INFO 03-01 18:56:36 [logger.py:42] Received request cmpl-16e74a77619f4ceb88893c5d6172742f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:36 [async_llm.py:261] Added request cmpl-16e74a77619f4ceb88893c5d6172742f-0. INFO 03-01 18:56:37 [logger.py:42] Received request cmpl-05306a671e1546879c48c08269137e7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:37 [async_llm.py:261] Added request cmpl-05306a671e1546879c48c08269137e7b-0. INFO 03-01 18:56:38 [logger.py:42] Received request cmpl-bfcefd4dbde24faea122c002d9bf5dcd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:38 [async_llm.py:261] Added request cmpl-bfcefd4dbde24faea122c002d9bf5dcd-0. INFO 03-01 18:56:39 [logger.py:42] Received request cmpl-217a85fea22649879fe6f327182bf3af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:39 [async_llm.py:261] Added request cmpl-217a85fea22649879fe6f327182bf3af-0. INFO 03-01 18:56:40 [logger.py:42] Received request cmpl-1eb8c7bd3b5f4133b71f4a2f404a0c8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:40 [async_llm.py:261] Added request cmpl-1eb8c7bd3b5f4133b71f4a2f404a0c8d-0. INFO 03-01 18:56:41 [logger.py:42] Received request cmpl-73255429d01c43d298f4e8807234f416-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:41 [async_llm.py:261] Added request cmpl-73255429d01c43d298f4e8807234f416-0. INFO 03-01 18:56:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 18:56:43 [logger.py:42] Received request cmpl-76805be0e2ef408599accf7ce0276bf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:43 [async_llm.py:261] Added request cmpl-76805be0e2ef408599accf7ce0276bf6-0. INFO 03-01 18:56:44 [logger.py:42] Received request cmpl-d8d6f226122b4760a11ffe7481a689a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:44 [async_llm.py:261] Added request cmpl-d8d6f226122b4760a11ffe7481a689a5-0. INFO 03-01 18:56:45 [logger.py:42] Received request cmpl-4152f2be660b42028504b34c9d7ab395-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:45 [async_llm.py:261] Added request cmpl-4152f2be660b42028504b34c9d7ab395-0. INFO 03-01 18:56:46 [logger.py:42] Received request cmpl-7b3c92f0205b407795582bcd6dc2cf70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:46 [async_llm.py:261] Added request cmpl-7b3c92f0205b407795582bcd6dc2cf70-0. INFO 03-01 18:56:47 [logger.py:42] Received request cmpl-df966c12f3d04b9e98cdcc6421c5f536-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:47 [async_llm.py:261] Added request cmpl-df966c12f3d04b9e98cdcc6421c5f536-0. INFO 03-01 18:56:49 [logger.py:42] Received request cmpl-fda6c19058a14a7e9bc179c28e0ad834-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:49 [async_llm.py:261] Added request cmpl-fda6c19058a14a7e9bc179c28e0ad834-0. INFO 03-01 18:56:50 [logger.py:42] Received request cmpl-b3813ebc905f4f1e8f1ad829670290a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:50 [async_llm.py:261] Added request cmpl-b3813ebc905f4f1e8f1ad829670290a2-0. INFO 03-01 18:56:51 [logger.py:42] Received request cmpl-bf12e42f138d4f708077322ce788c9b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:51 [async_llm.py:261] Added request cmpl-bf12e42f138d4f708077322ce788c9b1-0. INFO 03-01 18:56:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:56:52 [logger.py:42] Received request cmpl-8afa203ccd8248e6a34970cedddb5c47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:52 [async_llm.py:261] Added request cmpl-8afa203ccd8248e6a34970cedddb5c47-0. INFO 03-01 18:56:53 [logger.py:42] Received request cmpl-1566ba65956441a1a808d8a62d58467c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:53 [async_llm.py:261] Added request cmpl-1566ba65956441a1a808d8a62d58467c-0. INFO 03-01 18:56:55 [logger.py:42] Received request cmpl-c202f332ea3f42d5a9cd0d4808ca32c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:55 [async_llm.py:261] Added request cmpl-c202f332ea3f42d5a9cd0d4808ca32c7-0. INFO 03-01 18:56:56 [logger.py:42] Received request cmpl-5a1ddadd4bc54de5b80568ac517b8e7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:56 [async_llm.py:261] Added request cmpl-5a1ddadd4bc54de5b80568ac517b8e7f-0. INFO 03-01 18:56:57 [logger.py:42] Received request cmpl-3ecc318f79b84e22bcc6a5ceee8d6550-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:57 [async_llm.py:261] Added request cmpl-3ecc318f79b84e22bcc6a5ceee8d6550-0. INFO 03-01 18:56:58 [logger.py:42] Received request cmpl-74d7c785762f4ad0ab3716df813e4e02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:58 [async_llm.py:261] Added request cmpl-74d7c785762f4ad0ab3716df813e4e02-0. INFO 03-01 18:56:59 [logger.py:42] Received request cmpl-5473b656d15240f3be1d4c046f11138b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:56:59 [async_llm.py:261] Added request cmpl-5473b656d15240f3be1d4c046f11138b-0. INFO 03-01 18:57:00 [logger.py:42] Received request cmpl-02e1d3be901c4588a6a7acb2e00b05f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:00 [async_llm.py:261] Added request cmpl-02e1d3be901c4588a6a7acb2e00b05f5-0. INFO 03-01 18:57:02 [logger.py:42] Received request cmpl-ba6b7d11bef9428da15a3fd04a9138e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:02 [async_llm.py:261] Added request cmpl-ba6b7d11bef9428da15a3fd04a9138e6-0. INFO 03-01 18:57:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:57:03 [logger.py:42] Received request cmpl-6a7e3a727f974abfb79eb54830a7b373-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:03 [async_llm.py:261] Added request cmpl-6a7e3a727f974abfb79eb54830a7b373-0. INFO 03-01 18:57:04 [logger.py:42] Received request cmpl-f2e16e52f8ba4f4183618004fe7166ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:04 [async_llm.py:261] Added request cmpl-f2e16e52f8ba4f4183618004fe7166ad-0. INFO 03-01 18:57:05 [logger.py:42] Received request cmpl-897b25aaf4034c20baac943fbb08d513-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:05 [async_llm.py:261] Added request cmpl-897b25aaf4034c20baac943fbb08d513-0. INFO 03-01 18:57:06 [logger.py:42] Received request cmpl-9a638ebeb19c4e688ecdaec0d53d2290-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:06 [async_llm.py:261] Added request cmpl-9a638ebeb19c4e688ecdaec0d53d2290-0. INFO 03-01 18:57:07 [logger.py:42] Received request cmpl-d710362a818e43d28a3259cd9838bf82-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:07 [async_llm.py:261] Added request cmpl-d710362a818e43d28a3259cd9838bf82-0. INFO 03-01 18:57:09 [logger.py:42] Received request cmpl-67251367171749ce9dba2e72b8343fb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:09 [async_llm.py:261] Added request cmpl-67251367171749ce9dba2e72b8343fb7-0. INFO 03-01 18:57:10 [logger.py:42] Received request cmpl-9aeed69150d3490181fe768563a4e9b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:10 [async_llm.py:261] Added request cmpl-9aeed69150d3490181fe768563a4e9b0-0. INFO 03-01 18:57:11 [logger.py:42] Received request cmpl-2349c2b0ecfd496dad062abd82621d10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:11 [async_llm.py:261] Added request cmpl-2349c2b0ecfd496dad062abd82621d10-0. INFO 03-01 18:57:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:57:12 [logger.py:42] Received request cmpl-188fbc504b284b0f9fc33c567634223c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:12 [async_llm.py:261] Added request cmpl-188fbc504b284b0f9fc33c567634223c-0. INFO 03-01 18:57:13 [logger.py:42] Received request cmpl-a0b235e065a5481ca3fcf7448c055e15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:13 [async_llm.py:261] Added request cmpl-a0b235e065a5481ca3fcf7448c055e15-0. INFO 03-01 18:57:14 [logger.py:42] Received request cmpl-f0ba94626e19463aac4b86c0a6eccdc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:14 [async_llm.py:261] Added request cmpl-f0ba94626e19463aac4b86c0a6eccdc7-0. INFO 03-01 18:57:16 [logger.py:42] Received request cmpl-70e217786e1c487897fe62610635b5ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:16 [async_llm.py:261] Added request cmpl-70e217786e1c487897fe62610635b5ab-0. INFO 03-01 18:57:17 [logger.py:42] Received request cmpl-3a55c42e285f4910bbb59e71d1b76d7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:17 [async_llm.py:261] Added request cmpl-3a55c42e285f4910bbb59e71d1b76d7d-0. INFO 03-01 18:57:18 [logger.py:42] Received request cmpl-dbd1ae6fc2c14ad78cffcdb2df4e39f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:18 [async_llm.py:261] Added request cmpl-dbd1ae6fc2c14ad78cffcdb2df4e39f9-0. INFO 03-01 18:57:19 [logger.py:42] Received request cmpl-52344db15cee48b5b6d5a5f9605eae1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:19 [async_llm.py:261] Added request cmpl-52344db15cee48b5b6d5a5f9605eae1b-0. INFO 03-01 18:57:20 [logger.py:42] Received request cmpl-cf0b875b0d604c61ae15e95485d43b44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:20 [async_llm.py:261] Added request cmpl-cf0b875b0d604c61ae15e95485d43b44-0. INFO 03-01 18:57:21 [logger.py:42] Received request cmpl-eacb1c3a01bc4926b95f876554761590-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:21 [async_llm.py:261] Added request cmpl-eacb1c3a01bc4926b95f876554761590-0. INFO 03-01 18:57:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:57:22 [logger.py:42] Received request cmpl-f6a6cbe5f15b4aca9d005d4467633c21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:22 [async_llm.py:261] Added request cmpl-f6a6cbe5f15b4aca9d005d4467633c21-0. INFO 03-01 18:57:24 [logger.py:42] Received request cmpl-a7de5ab9c1024fc59334ce5c9ccb899c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:24 [async_llm.py:261] Added request cmpl-a7de5ab9c1024fc59334ce5c9ccb899c-0. INFO 03-01 18:57:25 [logger.py:42] Received request cmpl-34eec7b8a4ca4e57b22eeb4085684a18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:25 [async_llm.py:261] Added request cmpl-34eec7b8a4ca4e57b22eeb4085684a18-0. INFO 03-01 18:57:26 [logger.py:42] Received request cmpl-788b27c822bb4359ac032b60f00298aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:26 [async_llm.py:261] Added request cmpl-788b27c822bb4359ac032b60f00298aa-0. INFO 03-01 18:57:27 [logger.py:42] Received request cmpl-f50f6ecde1c447b290b923486ffa6244-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:27 [async_llm.py:261] Added request cmpl-f50f6ecde1c447b290b923486ffa6244-0. INFO 03-01 18:57:28 [logger.py:42] Received request cmpl-e41ff11d7c684ef9940fe2527eca0d9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:28 [async_llm.py:261] Added request cmpl-e41ff11d7c684ef9940fe2527eca0d9f-0. INFO 03-01 18:57:29 [logger.py:42] Received request cmpl-010eafdb90664e1e905e6879dd03fcf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:29 [async_llm.py:261] Added request cmpl-010eafdb90664e1e905e6879dd03fcf0-0. INFO 03-01 18:57:31 [logger.py:42] Received request cmpl-c8137c8c1d184f3ea5c3ddfb1d212805-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:31 [async_llm.py:261] Added request cmpl-c8137c8c1d184f3ea5c3ddfb1d212805-0. INFO 03-01 18:57:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:57:32 [logger.py:42] Received request cmpl-318516621bed4e50b03ad2ea1f563745-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:32 [async_llm.py:261] Added request cmpl-318516621bed4e50b03ad2ea1f563745-0. INFO 03-01 18:57:33 [logger.py:42] Received request cmpl-428c13c4361547a19b67b153eea79ab7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:33 [async_llm.py:261] Added request cmpl-428c13c4361547a19b67b153eea79ab7-0. INFO 03-01 18:57:34 [logger.py:42] Received request cmpl-472b474e9aa1490598b27037c718dd95-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:34 [async_llm.py:261] Added request cmpl-472b474e9aa1490598b27037c718dd95-0. INFO 03-01 18:57:35 [logger.py:42] Received request cmpl-58e3b1a282d1471785d30bda37f69f14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:35 [async_llm.py:261] Added request cmpl-58e3b1a282d1471785d30bda37f69f14-0. INFO 03-01 18:57:37 [logger.py:42] Received request cmpl-a9503a736af349a1ab9bb0049b22dcbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:37 [async_llm.py:261] Added request cmpl-a9503a736af349a1ab9bb0049b22dcbe-0. INFO 03-01 18:57:38 [logger.py:42] Received request cmpl-ee56ce3771924e8a8f959a4128883ca3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:38 [async_llm.py:261] Added request cmpl-ee56ce3771924e8a8f959a4128883ca3-0. INFO 03-01 18:57:39 [logger.py:42] Received request cmpl-23405b0c4fa84786a8f91de5d59f4135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:39 [async_llm.py:261] Added request cmpl-23405b0c4fa84786a8f91de5d59f4135-0. INFO 03-01 18:57:40 [logger.py:42] Received request cmpl-8f4d743e36144344b2a25deb880f16d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:40 [async_llm.py:261] Added request cmpl-8f4d743e36144344b2a25deb880f16d4-0. INFO 03-01 18:57:41 [logger.py:42] Received request cmpl-91fbe8153d8146ed9426bf88dc059d35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:41 [async_llm.py:261] Added request cmpl-91fbe8153d8146ed9426bf88dc059d35-0. INFO 03-01 18:57:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:57:42 [logger.py:42] Received request cmpl-a1480e4f71384269b013d7bcfef855b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:42 [async_llm.py:261] Added request cmpl-a1480e4f71384269b013d7bcfef855b1-0. INFO 03-01 18:57:44 [logger.py:42] Received request cmpl-65f0fdd765854f2f9516cff580181b38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:44 [async_llm.py:261] Added request cmpl-65f0fdd765854f2f9516cff580181b38-0. INFO 03-01 18:57:45 [logger.py:42] Received request cmpl-c19ad1c7b6494bef9e2a41fc57702664-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:45 [async_llm.py:261] Added request cmpl-c19ad1c7b6494bef9e2a41fc57702664-0. INFO 03-01 18:57:46 [logger.py:42] Received request cmpl-66046e14cbd34a8a856f90519f1c60ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:46 [async_llm.py:261] Added request cmpl-66046e14cbd34a8a856f90519f1c60ea-0. INFO 03-01 18:57:47 [logger.py:42] Received request cmpl-84205fcf453d4dc69e5e8341ece4a564-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:47 [async_llm.py:261] Added request cmpl-84205fcf453d4dc69e5e8341ece4a564-0. INFO 03-01 18:57:48 [logger.py:42] Received request cmpl-74388db0cac54a98884e5e84256fd7f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:48 [async_llm.py:261] Added request cmpl-74388db0cac54a98884e5e84256fd7f9-0. INFO 03-01 18:57:49 [logger.py:42] Received request cmpl-e095161cb91a424fab749b404403ee6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:49 [async_llm.py:261] Added request cmpl-e095161cb91a424fab749b404403ee6c-0. INFO 03-01 18:57:51 [logger.py:42] Received request cmpl-c8d0f3d528de481896ad63a8d64d87c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:51 [async_llm.py:261] Added request cmpl-c8d0f3d528de481896ad63a8d64d87c0-0. INFO 03-01 18:57:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:57:52 [logger.py:42] Received request cmpl-5275d4076040404fbebd6a0da81841d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:52 [async_llm.py:261] Added request cmpl-5275d4076040404fbebd6a0da81841d1-0. INFO 03-01 18:57:53 [logger.py:42] Received request cmpl-05f9cc88f1fa48279a44c506cd71d343-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:53 [async_llm.py:261] Added request cmpl-05f9cc88f1fa48279a44c506cd71d343-0. INFO 03-01 18:57:54 [logger.py:42] Received request cmpl-7917360f8a434025a61a7e428ea00304-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:54 [async_llm.py:261] Added request cmpl-7917360f8a434025a61a7e428ea00304-0. INFO 03-01 18:57:55 [logger.py:42] Received request cmpl-e70521d64f614a3b8f73837d592b1ea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:55 [async_llm.py:261] Added request cmpl-e70521d64f614a3b8f73837d592b1ea0-0. INFO 03-01 18:57:57 [logger.py:42] Received request cmpl-3fa2e1e4843c40e2a8d9ebd55282acdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:57 [async_llm.py:261] Added request cmpl-3fa2e1e4843c40e2a8d9ebd55282acdc-0. INFO 03-01 18:57:58 [logger.py:42] Received request cmpl-d920bdae7a5446c6b05475cc4427a09b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:58 [async_llm.py:261] Added request cmpl-d920bdae7a5446c6b05475cc4427a09b-0. INFO 03-01 18:57:59 [logger.py:42] Received request cmpl-717c7c1c410f4006a1dd34056e8d9b1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:57:59 [async_llm.py:261] Added request cmpl-717c7c1c410f4006a1dd34056e8d9b1f-0. INFO 03-01 18:58:00 [logger.py:42] Received request cmpl-3d0a0da1ab0246d9879f67f3811653ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:00 [async_llm.py:261] Added request cmpl-3d0a0da1ab0246d9879f67f3811653ca-0. INFO 03-01 18:58:01 [logger.py:42] Received request cmpl-23283caa3940431bb85257e78428f884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:01 [async_llm.py:261] Added request cmpl-23283caa3940431bb85257e78428f884-0. INFO 03-01 18:58:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:58:02 [logger.py:42] Received request cmpl-3fad7dad65fd4857b7597e4878c1377a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:02 [async_llm.py:261] Added request cmpl-3fad7dad65fd4857b7597e4878c1377a-0. INFO 03-01 18:58:04 [logger.py:42] Received request cmpl-67386bf9f4bf43caa6268db5e64e6bd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:04 [async_llm.py:261] Added request cmpl-67386bf9f4bf43caa6268db5e64e6bd2-0. INFO 03-01 18:58:05 [logger.py:42] Received request cmpl-a63af0e4ba8e4fd78f4c2c4cacdf2be4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:05 [async_llm.py:261] Added request cmpl-a63af0e4ba8e4fd78f4c2c4cacdf2be4-0. INFO 03-01 18:58:06 [logger.py:42] Received request cmpl-8b7efc286d6945479f597a5f2615b965-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:06 [async_llm.py:261] Added request cmpl-8b7efc286d6945479f597a5f2615b965-0. INFO 03-01 18:58:07 [logger.py:42] Received request cmpl-926766d642614efa8a34c1b76b745ed9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:07 [async_llm.py:261] Added request cmpl-926766d642614efa8a34c1b76b745ed9-0. INFO 03-01 18:58:08 [logger.py:42] Received request cmpl-8a880673524a4b29a81975c25d6494e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:08 [async_llm.py:261] Added request cmpl-8a880673524a4b29a81975c25d6494e8-0. INFO 03-01 18:58:09 [logger.py:42] Received request cmpl-7161b6a80f4e44a1b0efd183a740dd15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:09 [async_llm.py:261] Added request cmpl-7161b6a80f4e44a1b0efd183a740dd15-0. INFO 03-01 18:58:11 [logger.py:42] Received request cmpl-2704441ba01242d390608cc9a3cb8480-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:11 [async_llm.py:261] Added request cmpl-2704441ba01242d390608cc9a3cb8480-0. INFO 03-01 18:58:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:58:12 [logger.py:42] Received request cmpl-927b89b6409043579b8c69f1febc5248-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:12 [async_llm.py:261] Added request cmpl-927b89b6409043579b8c69f1febc5248-0. INFO 03-01 18:58:13 [logger.py:42] Received request cmpl-a9f6a3c48aed4eccaac6fe4628a07c9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:13 [async_llm.py:261] Added request cmpl-a9f6a3c48aed4eccaac6fe4628a07c9f-0. INFO 03-01 18:58:14 [logger.py:42] Received request cmpl-990e3cd9a59d46bca2746e149678c457-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:14 [async_llm.py:261] Added request cmpl-990e3cd9a59d46bca2746e149678c457-0. INFO 03-01 18:58:15 [logger.py:42] Received request cmpl-7abd9f88e7604c47b38ad86afe58621f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:15 [async_llm.py:261] Added request cmpl-7abd9f88e7604c47b38ad86afe58621f-0. INFO 03-01 18:58:16 [logger.py:42] Received request cmpl-d2c0a44e377b4fabba9ba6df18b58fc4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:16 [async_llm.py:261] Added request cmpl-d2c0a44e377b4fabba9ba6df18b58fc4-0. INFO 03-01 18:58:18 [logger.py:42] Received request cmpl-4fcdec1ca82e4a26a15a960534d18682-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:18 [async_llm.py:261] Added request cmpl-4fcdec1ca82e4a26a15a960534d18682-0. INFO 03-01 18:58:19 [logger.py:42] Received request cmpl-955a13d99e2141dd9be3b896cb3c2bc9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:19 [async_llm.py:261] Added request cmpl-955a13d99e2141dd9be3b896cb3c2bc9-0. INFO 03-01 18:58:20 [logger.py:42] Received request cmpl-5f1d2d4c217247cba0630ea50817e2dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:20 [async_llm.py:261] Added request cmpl-5f1d2d4c217247cba0630ea50817e2dd-0. INFO 03-01 18:58:21 [logger.py:42] Received request cmpl-a9ccf24bf556459eb36715fbdfff15a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:21 [async_llm.py:261] Added request cmpl-a9ccf24bf556459eb36715fbdfff15a5-0. INFO 03-01 18:58:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:58:22 [logger.py:42] Received request cmpl-dfbda3b6400844748100620bb8c908cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:22 [async_llm.py:261] Added request cmpl-dfbda3b6400844748100620bb8c908cc-0. INFO 03-01 18:58:23 [logger.py:42] Received request cmpl-e372a1a683854764b29bae784f3c9232-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:23 [async_llm.py:261] Added request cmpl-e372a1a683854764b29bae784f3c9232-0. INFO 03-01 18:58:24 [logger.py:42] Received request cmpl-f8437bdc68c74587b51791dd7e1bdb5f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:24 [async_llm.py:261] Added request cmpl-f8437bdc68c74587b51791dd7e1bdb5f-0. INFO 03-01 18:58:26 [logger.py:42] Received request cmpl-fd45276b130b4ea5886f755a82805a78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:26 [async_llm.py:261] Added request cmpl-fd45276b130b4ea5886f755a82805a78-0. INFO 03-01 18:58:27 [logger.py:42] Received request cmpl-9ee7908bd90a42018ce8cd9815a61d84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:27 [async_llm.py:261] Added request cmpl-9ee7908bd90a42018ce8cd9815a61d84-0. INFO 03-01 18:58:28 [logger.py:42] Received request cmpl-34d00e73fda346b38febe88e76b4dd2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:28 [async_llm.py:261] Added request cmpl-34d00e73fda346b38febe88e76b4dd2d-0. INFO 03-01 18:58:29 [logger.py:42] Received request cmpl-d3ffbd65884e440fa00651db0263792b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:29 [async_llm.py:261] Added request cmpl-d3ffbd65884e440fa00651db0263792b-0. INFO 03-01 18:58:30 [logger.py:42] Received request cmpl-18f93d539a32433f8e1953a63ef7e5db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:30 [async_llm.py:261] Added request cmpl-18f93d539a32433f8e1953a63ef7e5db-0. INFO 03-01 18:58:31 [logger.py:42] Received request cmpl-fd72ae0e97ba41258fb921a442375a96-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:31 [async_llm.py:261] Added request cmpl-fd72ae0e97ba41258fb921a442375a96-0. INFO 03-01 18:58:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:58:33 [logger.py:42] Received request cmpl-4c9b7d83926e404895955e0503949d34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:33 [async_llm.py:261] Added request cmpl-4c9b7d83926e404895955e0503949d34-0. INFO 03-01 18:58:34 [logger.py:42] Received request cmpl-d51ae15bf6fa4381ba42954f5e42c2d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:34 [async_llm.py:261] Added request cmpl-d51ae15bf6fa4381ba42954f5e42c2d6-0. INFO 03-01 18:58:35 [logger.py:42] Received request cmpl-5a3f44d9a8b74524b37c0ddaf8d7708d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:35 [async_llm.py:261] Added request cmpl-5a3f44d9a8b74524b37c0ddaf8d7708d-0. INFO 03-01 18:58:36 [logger.py:42] Received request cmpl-aa34b021ac834cc19676591be77f1086-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:36 [async_llm.py:261] Added request cmpl-aa34b021ac834cc19676591be77f1086-0. INFO 03-01 18:58:37 [logger.py:42] Received request cmpl-72bd84708f1345be85b807ed184cc6ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:37 [async_llm.py:261] Added request cmpl-72bd84708f1345be85b807ed184cc6ec-0. INFO 03-01 18:58:38 [logger.py:42] Received request cmpl-fc76d06b20274eeb909a15a2841c8863-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:38 [async_llm.py:261] Added request cmpl-fc76d06b20274eeb909a15a2841c8863-0. INFO 03-01 18:58:40 [logger.py:42] Received request cmpl-cbfbd9476af7415a9dc741ae7870c115-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:40 [async_llm.py:261] Added request cmpl-cbfbd9476af7415a9dc741ae7870c115-0. INFO 03-01 18:58:41 [logger.py:42] Received request cmpl-9005a155194d4bafa588f745a4f02dfc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:41 [async_llm.py:261] Added request cmpl-9005a155194d4bafa588f745a4f02dfc-0. INFO 03-01 18:58:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:58:42 [logger.py:42] Received request cmpl-2b009caa6f7d4c3bb02d78d4c980f554-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:42 [async_llm.py:261] Added request cmpl-2b009caa6f7d4c3bb02d78d4c980f554-0. INFO 03-01 18:58:43 [logger.py:42] Received request cmpl-4768f9aa83574e3698f255a4e56c59a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:43 [async_llm.py:261] Added request cmpl-4768f9aa83574e3698f255a4e56c59a9-0. INFO 03-01 18:58:44 [logger.py:42] Received request cmpl-1e81c250dce14a859524365cdeef30f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:44 [async_llm.py:261] Added request cmpl-1e81c250dce14a859524365cdeef30f7-0. INFO 03-01 18:58:45 [logger.py:42] Received request cmpl-9e21917209cc434fbd3b17588a6e2ad2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:45 [async_llm.py:261] Added request cmpl-9e21917209cc434fbd3b17588a6e2ad2-0. INFO 03-01 18:58:47 [logger.py:42] Received request cmpl-7a6b8efdc325408f848b3709ec37c043-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:47 [async_llm.py:261] Added request cmpl-7a6b8efdc325408f848b3709ec37c043-0. INFO 03-01 18:58:48 [logger.py:42] Received request cmpl-fd22d77a90724651925c1e24889956cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:48 [async_llm.py:261] Added request cmpl-fd22d77a90724651925c1e24889956cc-0. INFO 03-01 18:58:49 [logger.py:42] Received request cmpl-8c694cf0d57849838fd3965db378d05a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:49 [async_llm.py:261] Added request cmpl-8c694cf0d57849838fd3965db378d05a-0. INFO 03-01 18:58:50 [logger.py:42] Received request cmpl-10d1f234bb8c4187a2c350023a035814-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:50 [async_llm.py:261] Added request cmpl-10d1f234bb8c4187a2c350023a035814-0. INFO 03-01 18:58:51 [logger.py:42] Received request cmpl-d62fd1c4b9b6420bb386a747f280d149-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:51 [async_llm.py:261] Added request cmpl-d62fd1c4b9b6420bb386a747f280d149-0. INFO 03-01 18:58:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:58:52 [logger.py:42] Received request cmpl-6ba2e1b50d494a469335aca6c20557c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:52 [async_llm.py:261] Added request cmpl-6ba2e1b50d494a469335aca6c20557c9-0. INFO 03-01 18:58:54 [logger.py:42] Received request cmpl-970a6e29115542b4b9d137615a7f6568-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:54 [async_llm.py:261] Added request cmpl-970a6e29115542b4b9d137615a7f6568-0. INFO 03-01 18:58:55 [logger.py:42] Received request cmpl-c987e2d261af4e559b85a907b9c7eb42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:55 [async_llm.py:261] Added request cmpl-c987e2d261af4e559b85a907b9c7eb42-0. INFO 03-01 18:58:56 [logger.py:42] Received request cmpl-4185e419472f4df29ab7ebfaa6b2ad36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:56 [async_llm.py:261] Added request cmpl-4185e419472f4df29ab7ebfaa6b2ad36-0. INFO 03-01 18:58:57 [logger.py:42] Received request cmpl-b37745a3010b4250a32ea2b7689c872e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:57 [async_llm.py:261] Added request cmpl-b37745a3010b4250a32ea2b7689c872e-0. INFO 03-01 18:58:58 [logger.py:42] Received request cmpl-3be9c7aa3538444d8d9899539d22a0c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:58 [async_llm.py:261] Added request cmpl-3be9c7aa3538444d8d9899539d22a0c9-0. INFO 03-01 18:58:59 [logger.py:42] Received request cmpl-a43704eebaf84288933e503ed4ec8ca8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:58:59 [async_llm.py:261] Added request cmpl-a43704eebaf84288933e503ed4ec8ca8-0. INFO 03-01 18:59:00 [logger.py:42] Received request cmpl-9ce0864ac75449a18e7389f144ee55c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:00 [async_llm.py:261] Added request cmpl-9ce0864ac75449a18e7389f144ee55c3-0. INFO 03-01 18:59:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:59:02 [logger.py:42] Received request cmpl-afba5e094a484c90bb082aff7685f6db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:02 [async_llm.py:261] Added request cmpl-afba5e094a484c90bb082aff7685f6db-0. INFO 03-01 18:59:03 [logger.py:42] Received request cmpl-7dcb161bd89b43a59dd5205fb3fa5c0d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:03 [async_llm.py:261] Added request cmpl-7dcb161bd89b43a59dd5205fb3fa5c0d-0. INFO 03-01 18:59:04 [logger.py:42] Received request cmpl-ad22c92dcbe14372acfc1c7bfce107dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:04 [async_llm.py:261] Added request cmpl-ad22c92dcbe14372acfc1c7bfce107dd-0. INFO 03-01 18:59:05 [logger.py:42] Received request cmpl-868ec37d79004fbeb35c4dc1ad05c7f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:05 [async_llm.py:261] Added request cmpl-868ec37d79004fbeb35c4dc1ad05c7f6-0. INFO 03-01 18:59:06 [logger.py:42] Received request cmpl-af6454bf724f4a83852f32ad407c5f58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:06 [async_llm.py:261] Added request cmpl-af6454bf724f4a83852f32ad407c5f58-0. INFO 03-01 18:59:07 [logger.py:42] Received request cmpl-1e843a6a75cc4479b35c2f6f10d9e39d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:07 [async_llm.py:261] Added request cmpl-1e843a6a75cc4479b35c2f6f10d9e39d-0. INFO 03-01 18:59:09 [logger.py:42] Received request cmpl-a859cf8b4ca74811b9276b5638d1c023-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:09 [async_llm.py:261] Added request cmpl-a859cf8b4ca74811b9276b5638d1c023-0. INFO 03-01 18:59:10 [logger.py:42] Received request cmpl-53de812d9c84405aab05b6cedf5c18fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:10 [async_llm.py:261] Added request cmpl-53de812d9c84405aab05b6cedf5c18fd-0. INFO 03-01 18:59:11 [logger.py:42] Received request cmpl-a36662c6aed8496db7d0ad9615d8b97a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:11 [async_llm.py:261] Added request cmpl-a36662c6aed8496db7d0ad9615d8b97a-0. INFO 03-01 18:59:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:59:12 [logger.py:42] Received request cmpl-496d174fb400437e970154b70c39de9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:12 [async_llm.py:261] Added request cmpl-496d174fb400437e970154b70c39de9b-0. INFO 03-01 18:59:13 [logger.py:42] Received request cmpl-2b3860b37e1a415c8b0b6f552573109c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:13 [async_llm.py:261] Added request cmpl-2b3860b37e1a415c8b0b6f552573109c-0. INFO 03-01 18:59:14 [logger.py:42] Received request cmpl-d19647d31b11489cac9b01348ca6d19e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:14 [async_llm.py:261] Added request cmpl-d19647d31b11489cac9b01348ca6d19e-0. INFO 03-01 18:59:16 [logger.py:42] Received request cmpl-afa0278e1b734df9bfe033cbfaa94942-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:16 [async_llm.py:261] Added request cmpl-afa0278e1b734df9bfe033cbfaa94942-0. INFO 03-01 18:59:17 [logger.py:42] Received request cmpl-2c840f30c7d4492fa7d1c4cdfb41deba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:17 [async_llm.py:261] Added request cmpl-2c840f30c7d4492fa7d1c4cdfb41deba-0. INFO 03-01 18:59:18 [logger.py:42] Received request cmpl-66892c7a6eff4d5a84c410d02e218eba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:18 [async_llm.py:261] Added request cmpl-66892c7a6eff4d5a84c410d02e218eba-0. INFO 03-01 18:59:19 [logger.py:42] Received request cmpl-96808276d50846baaa83ad375264611c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:19 [async_llm.py:261] Added request cmpl-96808276d50846baaa83ad375264611c-0. INFO 03-01 18:59:20 [logger.py:42] Received request cmpl-46f220494f31450ebde138aeb5fae717-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:20 [async_llm.py:261] Added request cmpl-46f220494f31450ebde138aeb5fae717-0. INFO 03-01 18:59:21 [logger.py:42] Received request cmpl-4b6e601c662749e88d3ca22684b280ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:21 [async_llm.py:261] Added request cmpl-4b6e601c662749e88d3ca22684b280ac-0. INFO 03-01 18:59:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:59:23 [logger.py:42] Received request cmpl-119606c81dcd4ed39bcfd5a6f21b1549-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:23 [async_llm.py:261] Added request cmpl-119606c81dcd4ed39bcfd5a6f21b1549-0. INFO 03-01 18:59:24 [logger.py:42] Received request cmpl-b0382c2c1bba43a88a532c0baa5f7dca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:24 [async_llm.py:261] Added request cmpl-b0382c2c1bba43a88a532c0baa5f7dca-0. INFO 03-01 18:59:25 [logger.py:42] Received request cmpl-3dd3418018484daeab6cc3c5030fa524-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:25 [async_llm.py:261] Added request cmpl-3dd3418018484daeab6cc3c5030fa524-0. INFO 03-01 18:59:26 [logger.py:42] Received request cmpl-1f9f5a7a1a3c4705857fe28dc0967000-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:26 [async_llm.py:261] Added request cmpl-1f9f5a7a1a3c4705857fe28dc0967000-0. INFO 03-01 18:59:27 [logger.py:42] Received request cmpl-dd47197c57ef4512aca36cc435e39753-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:27 [async_llm.py:261] Added request cmpl-dd47197c57ef4512aca36cc435e39753-0. INFO 03-01 18:59:28 [logger.py:42] Received request cmpl-b557895ebe88498f879aacbd6de928fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:28 [async_llm.py:261] Added request cmpl-b557895ebe88498f879aacbd6de928fe-0. INFO 03-01 18:59:29 [logger.py:42] Received request cmpl-0777f4a2e1f547bda95873011c01c367-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:29 [async_llm.py:261] Added request cmpl-0777f4a2e1f547bda95873011c01c367-0. INFO 03-01 18:59:31 [logger.py:42] Received request cmpl-76502d0a911a4e0793d2bcdcbc0e4035-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:31 [async_llm.py:261] Added request cmpl-76502d0a911a4e0793d2bcdcbc0e4035-0. INFO 03-01 18:59:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:59:32 [logger.py:42] Received request cmpl-1f90cf77a4d8492d995ddc8e04e1bf2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:32 [async_llm.py:261] Added request cmpl-1f90cf77a4d8492d995ddc8e04e1bf2f-0. INFO 03-01 18:59:33 [logger.py:42] Received request cmpl-54c3152861594a67a9b87c4c86a62b25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:33 [async_llm.py:261] Added request cmpl-54c3152861594a67a9b87c4c86a62b25-0. INFO 03-01 18:59:34 [logger.py:42] Received request cmpl-bd582be4e7c940eda5b7b5cd946c13ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:34 [async_llm.py:261] Added request cmpl-bd582be4e7c940eda5b7b5cd946c13ed-0. INFO 03-01 18:59:35 [logger.py:42] Received request cmpl-18eb0b3b983b40d2925fc134878274b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:35 [async_llm.py:261] Added request cmpl-18eb0b3b983b40d2925fc134878274b0-0. INFO 03-01 18:59:36 [logger.py:42] Received request cmpl-90273a499fef45b4892abe9b8660345d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:36 [async_llm.py:261] Added request cmpl-90273a499fef45b4892abe9b8660345d-0. INFO 03-01 18:59:38 [logger.py:42] Received request cmpl-615dbf382ca04c4fb8d66da63b6a607d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:38 [async_llm.py:261] Added request cmpl-615dbf382ca04c4fb8d66da63b6a607d-0. INFO 03-01 18:59:39 [logger.py:42] Received request cmpl-c1c5422b5804457caa6513edcb5840c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:39 [async_llm.py:261] Added request cmpl-c1c5422b5804457caa6513edcb5840c4-0. INFO 03-01 18:59:40 [logger.py:42] Received request cmpl-0a21984c56f44c70b190e3ab17630a34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:40 [async_llm.py:261] Added request cmpl-0a21984c56f44c70b190e3ab17630a34-0. INFO 03-01 18:59:41 [logger.py:42] Received request cmpl-7ffe09d59e344583bd34301d0fba391a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:41 [async_llm.py:261] Added request cmpl-7ffe09d59e344583bd34301d0fba391a-0. INFO 03-01 18:59:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 18:59:42 [logger.py:42] Received request cmpl-390dbf4200c24b339d1444b5ad33537a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:42 [async_llm.py:261] Added request cmpl-390dbf4200c24b339d1444b5ad33537a-0. INFO 03-01 18:59:43 [logger.py:42] Received request cmpl-140013d0d0a64b21b7331e847f51807a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:43 [async_llm.py:261] Added request cmpl-140013d0d0a64b21b7331e847f51807a-0. INFO 03-01 18:59:45 [logger.py:42] Received request cmpl-08659d892cd24966b0b00945af964f11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:45 [async_llm.py:261] Added request cmpl-08659d892cd24966b0b00945af964f11-0. INFO 03-01 18:59:46 [logger.py:42] Received request cmpl-a6d17f1a73b44036b6666d679b80a1bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:46 [async_llm.py:261] Added request cmpl-a6d17f1a73b44036b6666d679b80a1bf-0. INFO 03-01 18:59:47 [logger.py:42] Received request cmpl-59cf40722e274da1b05372d0c79d59b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:47 [async_llm.py:261] Added request cmpl-59cf40722e274da1b05372d0c79d59b3-0. INFO 03-01 18:59:48 [logger.py:42] Received request cmpl-a22174b933694e24b30e2977f79e0a49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:48 [async_llm.py:261] Added request cmpl-a22174b933694e24b30e2977f79e0a49-0. INFO 03-01 18:59:49 [logger.py:42] Received request cmpl-bbf6ba861b824f8192b434ca2c2e565b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:49 [async_llm.py:261] Added request cmpl-bbf6ba861b824f8192b434ca2c2e565b-0. INFO 03-01 18:59:50 [logger.py:42] Received request cmpl-517fae973a63446aae0625f9752a89ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:50 [async_llm.py:261] Added request cmpl-517fae973a63446aae0625f9752a89ac-0. INFO 03-01 18:59:52 [logger.py:42] Received request cmpl-a1762d071f194e74a17bd89100c968b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:52 [async_llm.py:261] Added request cmpl-a1762d071f194e74a17bd89100c968b9-0. INFO 03-01 18:59:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 18:59:53 [logger.py:42] Received request cmpl-a96bfe4dd85a44b8b65ffa36c29acdbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:53 [async_llm.py:261] Added request cmpl-a96bfe4dd85a44b8b65ffa36c29acdbe-0. INFO 03-01 18:59:54 [logger.py:42] Received request cmpl-c5bb6171debe4e7499a5ff092f176be0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:54 [async_llm.py:261] Added request cmpl-c5bb6171debe4e7499a5ff092f176be0-0. INFO 03-01 18:59:55 [logger.py:42] Received request cmpl-a5cb601ab7a24619b96ce7cf3f5be175-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:55 [async_llm.py:261] Added request cmpl-a5cb601ab7a24619b96ce7cf3f5be175-0. INFO 03-01 18:59:56 [logger.py:42] Received request cmpl-5623fa2ad8b648908eefa7e62ca847d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:56 [async_llm.py:261] Added request cmpl-5623fa2ad8b648908eefa7e62ca847d7-0. INFO 03-01 18:59:57 [logger.py:42] Received request cmpl-f8e6977d723e4110b2896b2627fe6d8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:57 [async_llm.py:261] Added request cmpl-f8e6977d723e4110b2896b2627fe6d8f-0. INFO 03-01 18:59:58 [logger.py:42] Received request cmpl-9069e7bacd0743909e25fb427f4adda6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 18:59:58 [async_llm.py:261] Added request cmpl-9069e7bacd0743909e25fb427f4adda6-0. INFO 03-01 19:00:00 [logger.py:42] Received request cmpl-847198602cc848d6bcdd7a8ef5cb8fc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:00 [async_llm.py:261] Added request cmpl-847198602cc848d6bcdd7a8ef5cb8fc3-0. INFO 03-01 19:00:01 [logger.py:42] Received request cmpl-d52a9d2be2634716892d9e207dd53b55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:01 [async_llm.py:261] Added request cmpl-d52a9d2be2634716892d9e207dd53b55-0. INFO 03-01 19:00:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:00:02 [logger.py:42] Received request cmpl-b1c07af5310746a780baddb08bf44a7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:02 [async_llm.py:261] Added request cmpl-b1c07af5310746a780baddb08bf44a7d-0. INFO 03-01 19:00:03 [logger.py:42] Received request cmpl-4d22ed0badfe4614baf0eb9f9ae14598-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:03 [async_llm.py:261] Added request cmpl-4d22ed0badfe4614baf0eb9f9ae14598-0. INFO 03-01 19:00:04 [logger.py:42] Received request cmpl-d784187778c64a3b8eb2dec0a731044d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:04 [async_llm.py:261] Added request cmpl-d784187778c64a3b8eb2dec0a731044d-0. INFO 03-01 19:00:05 [logger.py:42] Received request cmpl-11800253cd2343c08289419056042915-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:05 [async_llm.py:261] Added request cmpl-11800253cd2343c08289419056042915-0. INFO 03-01 19:00:07 [logger.py:42] Received request cmpl-708d6c72744d4790942e49ac72ddd26b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:07 [async_llm.py:261] Added request cmpl-708d6c72744d4790942e49ac72ddd26b-0. INFO 03-01 19:00:08 [logger.py:42] Received request cmpl-c17759a1e28c412eafbd393e887d74ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:08 [async_llm.py:261] Added request cmpl-c17759a1e28c412eafbd393e887d74ec-0. INFO 03-01 19:00:09 [logger.py:42] Received request cmpl-12333f574156493caf804ff0d775f93b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:09 [async_llm.py:261] Added request cmpl-12333f574156493caf804ff0d775f93b-0. INFO 03-01 19:00:10 [logger.py:42] Received request cmpl-e3c1f0cf585246d58a951bb04d17fb01-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:10 [async_llm.py:261] Added request cmpl-e3c1f0cf585246d58a951bb04d17fb01-0. INFO 03-01 19:00:11 [logger.py:42] Received request cmpl-5878a61c23714314b5c1e0f7d82b909b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:11 [async_llm.py:261] Added request cmpl-5878a61c23714314b5c1e0f7d82b909b-0. INFO 03-01 19:00:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:00:12 [logger.py:42] Received request cmpl-b3725a4bd1f64091919b09ea105174c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:12 [async_llm.py:261] Added request cmpl-b3725a4bd1f64091919b09ea105174c5-0. INFO 03-01 19:00:14 [logger.py:42] Received request cmpl-e682c7ec335e4341b4dba75bd39fee4b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:14 [async_llm.py:261] Added request cmpl-e682c7ec335e4341b4dba75bd39fee4b-0. INFO 03-01 19:00:15 [logger.py:42] Received request cmpl-b5a077b16ddb433f83e575a393a31232-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:15 [async_llm.py:261] Added request cmpl-b5a077b16ddb433f83e575a393a31232-0. INFO 03-01 19:00:16 [logger.py:42] Received request cmpl-d101d3aab3ce4c9dbd134374500ba4c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:16 [async_llm.py:261] Added request cmpl-d101d3aab3ce4c9dbd134374500ba4c2-0. INFO 03-01 19:00:17 [logger.py:42] Received request cmpl-a9b3f85ce8d14b97a29b0b777543e560-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:17 [async_llm.py:261] Added request cmpl-a9b3f85ce8d14b97a29b0b777543e560-0. INFO 03-01 19:00:18 [logger.py:42] Received request cmpl-c66c3a61b8c94d7aa1cb194c6a8efaf9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:18 [async_llm.py:261] Added request cmpl-c66c3a61b8c94d7aa1cb194c6a8efaf9-0. INFO 03-01 19:00:19 [logger.py:42] Received request cmpl-405a0ee190474a3c8be7e96af3807a6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:19 [async_llm.py:261] Added request cmpl-405a0ee190474a3c8be7e96af3807a6b-0. INFO 03-01 19:00:20 [logger.py:42] Received request cmpl-d96e496879e44086a7f5601b0947854a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:20 [async_llm.py:261] Added request cmpl-d96e496879e44086a7f5601b0947854a-0. INFO 03-01 19:00:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:00:22 [logger.py:42] Received request cmpl-bb1709aa78fb49cfa64f8bdee6fe6174-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:22 [async_llm.py:261] Added request cmpl-bb1709aa78fb49cfa64f8bdee6fe6174-0. INFO 03-01 19:00:23 [logger.py:42] Received request cmpl-5df7e24c8ba34a38934a579243291624-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:23 [async_llm.py:261] Added request cmpl-5df7e24c8ba34a38934a579243291624-0. INFO 03-01 19:00:24 [logger.py:42] Received request cmpl-348796a2e2c349b79374d6be62031721-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:24 [async_llm.py:261] Added request cmpl-348796a2e2c349b79374d6be62031721-0. INFO 03-01 19:00:25 [logger.py:42] Received request cmpl-35fe9db38791443a9030c1ea36ffad31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:25 [async_llm.py:261] Added request cmpl-35fe9db38791443a9030c1ea36ffad31-0. INFO 03-01 19:00:26 [logger.py:42] Received request cmpl-0d4bed93c745428bb2436c6f5f6da337-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:26 [async_llm.py:261] Added request cmpl-0d4bed93c745428bb2436c6f5f6da337-0. INFO 03-01 19:00:27 [logger.py:42] Received request cmpl-4274944124e44c2c88f967ca59f261f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:27 [async_llm.py:261] Added request cmpl-4274944124e44c2c88f967ca59f261f4-0. INFO 03-01 19:00:29 [logger.py:42] Received request cmpl-4f6a1576a11344d393bf8b4faf8e9c33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:29 [async_llm.py:261] Added request cmpl-4f6a1576a11344d393bf8b4faf8e9c33-0. INFO 03-01 19:00:30 [logger.py:42] Received request cmpl-74780b0c40134975a8279b0bee0df49a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:30 [async_llm.py:261] Added request cmpl-74780b0c40134975a8279b0bee0df49a-0. INFO 03-01 19:00:31 [logger.py:42] Received request cmpl-45844decd7204e7692d94c7932a0f3f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:31 [async_llm.py:261] Added request cmpl-45844decd7204e7692d94c7932a0f3f5-0. INFO 03-01 19:00:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:00:32 [logger.py:42] Received request cmpl-73dab2dba2de490e88a15c7bc983e2ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:32 [async_llm.py:261] Added request cmpl-73dab2dba2de490e88a15c7bc983e2ed-0. INFO 03-01 19:00:33 [logger.py:42] Received request cmpl-520641a8810445eda3b3a2d637e7f30e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:33 [async_llm.py:261] Added request cmpl-520641a8810445eda3b3a2d637e7f30e-0. INFO 03-01 19:00:34 [logger.py:42] Received request cmpl-eaec59d8bc2947f6af6f647628eff6b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:34 [async_llm.py:261] Added request cmpl-eaec59d8bc2947f6af6f647628eff6b4-0. INFO 03-01 19:00:36 [logger.py:42] Received request cmpl-30eddcca8f344367bed163b33f6a5ac6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:36 [async_llm.py:261] Added request cmpl-30eddcca8f344367bed163b33f6a5ac6-0. INFO 03-01 19:00:37 [logger.py:42] Received request cmpl-a038107dcb7e41a3b15cb4c092869147-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:37 [async_llm.py:261] Added request cmpl-a038107dcb7e41a3b15cb4c092869147-0. INFO 03-01 19:00:38 [logger.py:42] Received request cmpl-071985a188c740cd884923c914c3f20a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:38 [async_llm.py:261] Added request cmpl-071985a188c740cd884923c914c3f20a-0. INFO 03-01 19:00:39 [logger.py:42] Received request cmpl-b570d1990c5e4cfdab77358f34659014-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:39 [async_llm.py:261] Added request cmpl-b570d1990c5e4cfdab77358f34659014-0. INFO 03-01 19:00:40 [logger.py:42] Received request cmpl-3d38057acc244db8986e1dedfedb44c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:40 [async_llm.py:261] Added request cmpl-3d38057acc244db8986e1dedfedb44c1-0. INFO 03-01 19:00:41 [logger.py:42] Received request cmpl-2c7c71c464ab494186894b110072234d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:41 [async_llm.py:261] Added request cmpl-2c7c71c464ab494186894b110072234d-0. INFO 03-01 19:00:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:00:43 [logger.py:42] Received request cmpl-e43e8f016e3b47ac95752e40a7354927-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:43 [async_llm.py:261] Added request cmpl-e43e8f016e3b47ac95752e40a7354927-0. INFO 03-01 19:00:44 [logger.py:42] Received request cmpl-bc597d44a6d3490fb6d9fb3716749aab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:44 [async_llm.py:261] Added request cmpl-bc597d44a6d3490fb6d9fb3716749aab-0. INFO 03-01 19:00:45 [logger.py:42] Received request cmpl-0eef2c0581df488a9cd03d359115435c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:45 [async_llm.py:261] Added request cmpl-0eef2c0581df488a9cd03d359115435c-0. INFO 03-01 19:00:46 [logger.py:42] Received request cmpl-77622d0189434403af5f3605c605b3fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:46 [async_llm.py:261] Added request cmpl-77622d0189434403af5f3605c605b3fd-0. INFO 03-01 19:00:47 [logger.py:42] Received request cmpl-4b617b89604f4239920d1f69def503b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:47 [async_llm.py:261] Added request cmpl-4b617b89604f4239920d1f69def503b1-0. INFO 03-01 19:00:48 [logger.py:42] Received request cmpl-154d3c67771b4a03aab0a3f4f1ae0540-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:48 [async_llm.py:261] Added request cmpl-154d3c67771b4a03aab0a3f4f1ae0540-0. INFO 03-01 19:00:49 [logger.py:42] Received request cmpl-f82b7065f46348a9a01e5a9d81355ecb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:49 [async_llm.py:261] Added request cmpl-f82b7065f46348a9a01e5a9d81355ecb-0. INFO 03-01 19:00:51 [logger.py:42] Received request cmpl-438263626b2544fca0fdd272a748ba65-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:51 [async_llm.py:261] Added request cmpl-438263626b2544fca0fdd272a748ba65-0. INFO 03-01 19:00:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:00:52 [logger.py:42] Received request cmpl-269e2df0fa3249cea9d34d0260706e80-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:52 [async_llm.py:261] Added request cmpl-269e2df0fa3249cea9d34d0260706e80-0. INFO 03-01 19:00:53 [logger.py:42] Received request cmpl-3d98acccab204c32aec37dd82efd245a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:53 [async_llm.py:261] Added request cmpl-3d98acccab204c32aec37dd82efd245a-0. INFO 03-01 19:00:54 [logger.py:42] Received request cmpl-7c88a9dd69dc47159a0fdf2e7ef06e6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:54 [async_llm.py:261] Added request cmpl-7c88a9dd69dc47159a0fdf2e7ef06e6e-0. INFO 03-01 19:00:55 [logger.py:42] Received request cmpl-226a375e7f8a42089f42f4003111b53e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:55 [async_llm.py:261] Added request cmpl-226a375e7f8a42089f42f4003111b53e-0. INFO 03-01 19:00:57 [logger.py:42] Received request cmpl-e20028c8ee0748fa95a4075099e05435-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:57 [async_llm.py:261] Added request cmpl-e20028c8ee0748fa95a4075099e05435-0. INFO 03-01 19:00:58 [logger.py:42] Received request cmpl-c6ba8d6307b747f5baf13628a6e068d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:58 [async_llm.py:261] Added request cmpl-c6ba8d6307b747f5baf13628a6e068d5-0. INFO 03-01 19:00:59 [logger.py:42] Received request cmpl-930901ed5faa450ca9749201c53daddf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:00:59 [async_llm.py:261] Added request cmpl-930901ed5faa450ca9749201c53daddf-0. INFO 03-01 19:01:00 [logger.py:42] Received request cmpl-f703d325edb14523973e6c55c499e23c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:00 [async_llm.py:261] Added request cmpl-f703d325edb14523973e6c55c499e23c-0. INFO 03-01 19:01:01 [logger.py:42] Received request cmpl-e8da48ce2ca2481bb4c67e0e14d3262b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:01 [async_llm.py:261] Added request cmpl-e8da48ce2ca2481bb4c67e0e14d3262b-0. INFO 03-01 19:01:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:01:02 [logger.py:42] Received request cmpl-50ace396c65e4eab960a8931f93fe875-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:02 [async_llm.py:261] Added request cmpl-50ace396c65e4eab960a8931f93fe875-0. INFO 03-01 19:01:04 [logger.py:42] Received request cmpl-a254f891e1fa408eb817aaa95d462b39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:04 [async_llm.py:261] Added request cmpl-a254f891e1fa408eb817aaa95d462b39-0. INFO 03-01 19:01:05 [logger.py:42] Received request cmpl-e150be6190124d159eb6c4d173eead2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:05 [async_llm.py:261] Added request cmpl-e150be6190124d159eb6c4d173eead2f-0. INFO 03-01 19:01:06 [logger.py:42] Received request cmpl-a7ee0f53bbd44ed0bcbc1760c7e20982-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:06 [async_llm.py:261] Added request cmpl-a7ee0f53bbd44ed0bcbc1760c7e20982-0. INFO 03-01 19:01:07 [logger.py:42] Received request cmpl-fa9fa76d37bf41eeb30aca5f13384f33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:07 [async_llm.py:261] Added request cmpl-fa9fa76d37bf41eeb30aca5f13384f33-0. INFO 03-01 19:01:08 [logger.py:42] Received request cmpl-95319e07d0fe4529ad82fb4eea10f9ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:08 [async_llm.py:261] Added request cmpl-95319e07d0fe4529ad82fb4eea10f9ee-0. INFO 03-01 19:01:10 [logger.py:42] Received request cmpl-09f7433cde6b4610a745610248798c62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:10 [async_llm.py:261] Added request cmpl-09f7433cde6b4610a745610248798c62-0. INFO 03-01 19:01:11 [logger.py:42] Received request cmpl-8f4b744ce8f14ef482e78e91802abb55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:11 [async_llm.py:261] Added request cmpl-8f4b744ce8f14ef482e78e91802abb55-0. INFO 03-01 19:01:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:01:12 [logger.py:42] Received request cmpl-4feeef68437f4ba589938c4fee166bb1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:12 [async_llm.py:261] Added request cmpl-4feeef68437f4ba589938c4fee166bb1-0. INFO 03-01 19:01:13 [logger.py:42] Received request cmpl-bd7e21fab9824401b6ca2dffacd642b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:13 [async_llm.py:261] Added request cmpl-bd7e21fab9824401b6ca2dffacd642b7-0. INFO 03-01 19:01:14 [logger.py:42] Received request cmpl-f678590968bf44288accec19495356a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:14 [async_llm.py:261] Added request cmpl-f678590968bf44288accec19495356a0-0. INFO 03-01 19:01:15 [logger.py:42] Received request cmpl-6b5724e582ed42d392a1e47beaa6a8d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:15 [async_llm.py:261] Added request cmpl-6b5724e582ed42d392a1e47beaa6a8d5-0. INFO 03-01 19:01:17 [logger.py:42] Received request cmpl-63e813a2d3a1446eaedadb35f24949e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:17 [async_llm.py:261] Added request cmpl-63e813a2d3a1446eaedadb35f24949e4-0. INFO 03-01 19:01:18 [logger.py:42] Received request cmpl-097129ed3bd74707aabacf29a76c0471-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:18 [async_llm.py:261] Added request cmpl-097129ed3bd74707aabacf29a76c0471-0. INFO 03-01 19:01:19 [logger.py:42] Received request cmpl-c631aa96df384eaf848473470051d30f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:19 [async_llm.py:261] Added request cmpl-c631aa96df384eaf848473470051d30f-0. INFO 03-01 19:01:20 [logger.py:42] Received request cmpl-6ce16d8839ef4288b8de07ff4b428a7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:20 [async_llm.py:261] Added request cmpl-6ce16d8839ef4288b8de07ff4b428a7f-0. INFO 03-01 19:01:21 [logger.py:42] Received request cmpl-ecc1838427ef4a5f87448abd9edc2f0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:21 [async_llm.py:261] Added request cmpl-ecc1838427ef4a5f87448abd9edc2f0b-0. INFO 03-01 19:01:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:01:22 [logger.py:42] Received request cmpl-7ce17e7d2b0242f3a4f7eab3f4a56147-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:22 [async_llm.py:261] Added request cmpl-7ce17e7d2b0242f3a4f7eab3f4a56147-0. INFO 03-01 19:01:24 [logger.py:42] Received request cmpl-2e69d63ac1ca49aba38071e1092a24d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:24 [async_llm.py:261] Added request cmpl-2e69d63ac1ca49aba38071e1092a24d0-0. INFO 03-01 19:01:25 [logger.py:42] Received request cmpl-113c571b0e114045ae2783aab88e8dc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:25 [async_llm.py:261] Added request cmpl-113c571b0e114045ae2783aab88e8dc8-0. INFO 03-01 19:01:26 [logger.py:42] Received request cmpl-aee35cd2a67a44fbb5b5f32c8b2c528e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:26 [async_llm.py:261] Added request cmpl-aee35cd2a67a44fbb5b5f32c8b2c528e-0. INFO 03-01 19:01:27 [logger.py:42] Received request cmpl-3a598f10b5414c27849ffad46574f8b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:27 [async_llm.py:261] Added request cmpl-3a598f10b5414c27849ffad46574f8b8-0. INFO 03-01 19:01:28 [logger.py:42] Received request cmpl-eaa9f5a203654c0d83b16fc736aff8fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:28 [async_llm.py:261] Added request cmpl-eaa9f5a203654c0d83b16fc736aff8fd-0. INFO 03-01 19:01:29 [logger.py:42] Received request cmpl-5708fe6303f746da879712f2074aa8fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:29 [async_llm.py:261] Added request cmpl-5708fe6303f746da879712f2074aa8fe-0. INFO 03-01 19:01:31 [logger.py:42] Received request cmpl-2311cebea4a64fa6afe9672a8454d9e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:31 [async_llm.py:261] Added request cmpl-2311cebea4a64fa6afe9672a8454d9e0-0. INFO 03-01 19:01:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:01:32 [logger.py:42] Received request cmpl-6fe4d74d6afd4e7cb981752ba6e39d25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:32 [async_llm.py:261] Added request cmpl-6fe4d74d6afd4e7cb981752ba6e39d25-0. INFO 03-01 19:01:33 [logger.py:42] Received request cmpl-4b5397373e29400d886a06557cbe4c39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:33 [async_llm.py:261] Added request cmpl-4b5397373e29400d886a06557cbe4c39-0. INFO 03-01 19:01:34 [logger.py:42] Received request cmpl-5fd2a42184b74785913206ac83750915-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:34 [async_llm.py:261] Added request cmpl-5fd2a42184b74785913206ac83750915-0. INFO 03-01 19:01:35 [logger.py:42] Received request cmpl-6efe20aee6034ba995d342470cd82fa3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:35 [async_llm.py:261] Added request cmpl-6efe20aee6034ba995d342470cd82fa3-0. INFO 03-01 19:01:36 [logger.py:42] Received request cmpl-bc731133cf0c489fb4b5a29df67bddff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:36 [async_llm.py:261] Added request cmpl-bc731133cf0c489fb4b5a29df67bddff-0. INFO 03-01 19:01:38 [logger.py:42] Received request cmpl-eb1a0fb3b31341daa19c2447d0bc2285-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:38 [async_llm.py:261] Added request cmpl-eb1a0fb3b31341daa19c2447d0bc2285-0. INFO 03-01 19:01:39 [logger.py:42] Received request cmpl-9ad3289b7351421b87e9e92463d69d1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:39 [async_llm.py:261] Added request cmpl-9ad3289b7351421b87e9e92463d69d1e-0. INFO 03-01 19:01:40 [logger.py:42] Received request cmpl-abd80bf5d1074b90b317fa5f32e2bafe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:40 [async_llm.py:261] Added request cmpl-abd80bf5d1074b90b317fa5f32e2bafe-0. INFO 03-01 19:01:41 [logger.py:42] Received request cmpl-5777fc6161bb4dfd81db8ade5903f97e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:41 [async_llm.py:261] Added request cmpl-5777fc6161bb4dfd81db8ade5903f97e-0. INFO 03-01 19:01:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:01:42 [logger.py:42] Received request cmpl-1aea58574cce4635a7fc35563fe5a64b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:42 [async_llm.py:261] Added request cmpl-1aea58574cce4635a7fc35563fe5a64b-0. INFO 03-01 19:01:44 [logger.py:42] Received request cmpl-a62d69c36aeb45f99379a963e42391fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:44 [async_llm.py:261] Added request cmpl-a62d69c36aeb45f99379a963e42391fd-0. INFO 03-01 19:01:45 [logger.py:42] Received request cmpl-afcf013cd8c046619709e17643acb3f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:45 [async_llm.py:261] Added request cmpl-afcf013cd8c046619709e17643acb3f1-0. INFO 03-01 19:01:46 [logger.py:42] Received request cmpl-a711a7df0af746dd961dc6c7b9016649-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:46 [async_llm.py:261] Added request cmpl-a711a7df0af746dd961dc6c7b9016649-0. INFO 03-01 19:01:47 [logger.py:42] Received request cmpl-56f333dcf3a847c7bed6d3e49c272035-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:47 [async_llm.py:261] Added request cmpl-56f333dcf3a847c7bed6d3e49c272035-0. INFO 03-01 19:01:48 [logger.py:42] Received request cmpl-15049869418044eaaaf81ac6355cf9b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:48 [async_llm.py:261] Added request cmpl-15049869418044eaaaf81ac6355cf9b6-0. INFO 03-01 19:01:49 [logger.py:42] Received request cmpl-2a2b38a86acc4a2ca42bad5afb320ea0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:49 [async_llm.py:261] Added request cmpl-2a2b38a86acc4a2ca42bad5afb320ea0-0. INFO 03-01 19:01:51 [logger.py:42] Received request cmpl-a08008fc96b24f46abd4399ce03bab5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:51 [async_llm.py:261] Added request cmpl-a08008fc96b24f46abd4399ce03bab5b-0. INFO 03-01 19:01:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:01:52 [logger.py:42] Received request cmpl-ebc34b4a24ed4d22a8da6936e70bd29c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:52 [async_llm.py:261] Added request cmpl-ebc34b4a24ed4d22a8da6936e70bd29c-0. INFO 03-01 19:01:53 [logger.py:42] Received request cmpl-fd68a0baf7d44c8cb8ac6085f7e6ada1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:53 [async_llm.py:261] Added request cmpl-fd68a0baf7d44c8cb8ac6085f7e6ada1-0. INFO 03-01 19:01:54 [logger.py:42] Received request cmpl-d7ceb90dd17d457eb9e5a9dadc5fbe4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:54 [async_llm.py:261] Added request cmpl-d7ceb90dd17d457eb9e5a9dadc5fbe4e-0. INFO 03-01 19:01:55 [logger.py:42] Received request cmpl-35e0ae47ad654de38100143c7f83063e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:55 [async_llm.py:261] Added request cmpl-35e0ae47ad654de38100143c7f83063e-0. INFO 03-01 19:01:56 [logger.py:42] Received request cmpl-505b96d465f94b7395e88956f3c62bdd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:56 [async_llm.py:261] Added request cmpl-505b96d465f94b7395e88956f3c62bdd-0. INFO 03-01 19:01:58 [logger.py:42] Received request cmpl-afee2bd9db334d3c9682186f00e50cd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:58 [async_llm.py:261] Added request cmpl-afee2bd9db334d3c9682186f00e50cd5-0. INFO 03-01 19:01:59 [logger.py:42] Received request cmpl-a931e8e7e595442a810377851dc282e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:01:59 [async_llm.py:261] Added request cmpl-a931e8e7e595442a810377851dc282e5-0. INFO 03-01 19:02:00 [logger.py:42] Received request cmpl-1490992dcccf494bb82f4b8d7fe2ac15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:00 [async_llm.py:261] Added request cmpl-1490992dcccf494bb82f4b8d7fe2ac15-0. INFO 03-01 19:02:01 [logger.py:42] Received request cmpl-2d05b54d26604f2ca910ce09d2cb2083-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:01 [async_llm.py:261] Added request cmpl-2d05b54d26604f2ca910ce09d2cb2083-0. INFO 03-01 19:02:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:02:02 [logger.py:42] Received request cmpl-4bc1bb9106c84adaaf8584b7c756f7b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:02 [async_llm.py:261] Added request cmpl-4bc1bb9106c84adaaf8584b7c756f7b6-0. INFO 03-01 19:02:04 [logger.py:42] Received request cmpl-a7a0b27cf2974795b5c240ab959f3b2c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:04 [async_llm.py:261] Added request cmpl-a7a0b27cf2974795b5c240ab959f3b2c-0. INFO 03-01 19:02:05 [logger.py:42] Received request cmpl-42cf72496c254ddea8114bfda95ba477-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:05 [async_llm.py:261] Added request cmpl-42cf72496c254ddea8114bfda95ba477-0. INFO 03-01 19:02:06 [logger.py:42] Received request cmpl-f9ca6f3f3d4344d0a3ba78eb9df2124e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:06 [async_llm.py:261] Added request cmpl-f9ca6f3f3d4344d0a3ba78eb9df2124e-0. INFO 03-01 19:02:07 [logger.py:42] Received request cmpl-d6c09583081849cdb61aa14a83a73b90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:07 [async_llm.py:261] Added request cmpl-d6c09583081849cdb61aa14a83a73b90-0. INFO 03-01 19:02:08 [logger.py:42] Received request cmpl-38e14429a04143c88632c3cb4c360bb2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:08 [async_llm.py:261] Added request cmpl-38e14429a04143c88632c3cb4c360bb2-0. INFO 03-01 19:02:09 [logger.py:42] Received request cmpl-973b8b201f7a48cab5bdf0d8a866a91d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:09 [async_llm.py:261] Added request cmpl-973b8b201f7a48cab5bdf0d8a866a91d-0. INFO 03-01 19:02:11 [logger.py:42] Received request cmpl-908aaaf852d14cc29f146b66c42efab1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:11 [async_llm.py:261] Added request cmpl-908aaaf852d14cc29f146b66c42efab1-0. INFO 03-01 19:02:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:02:12 [logger.py:42] Received request cmpl-1ce56e749bb34be3a641574c281ca82f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:12 [async_llm.py:261] Added request cmpl-1ce56e749bb34be3a641574c281ca82f-0. INFO 03-01 19:02:13 [logger.py:42] Received request cmpl-612e8b894996405b8b038b87923b1be7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:13 [async_llm.py:261] Added request cmpl-612e8b894996405b8b038b87923b1be7-0. INFO 03-01 19:02:14 [logger.py:42] Received request cmpl-41f8317df4fa4a66924304ba22afa90d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:14 [async_llm.py:261] Added request cmpl-41f8317df4fa4a66924304ba22afa90d-0. INFO 03-01 19:02:15 [logger.py:42] Received request cmpl-6c183974b4934490bf5a5e01dbe0955b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:15 [async_llm.py:261] Added request cmpl-6c183974b4934490bf5a5e01dbe0955b-0. INFO 03-01 19:02:16 [logger.py:42] Received request cmpl-45d5e1eaa266413992dc9721bccb6c1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:16 [async_llm.py:261] Added request cmpl-45d5e1eaa266413992dc9721bccb6c1a-0. INFO 03-01 19:02:18 [logger.py:42] Received request cmpl-ec901acdc6fc4817a12485ed72a5a52c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:18 [async_llm.py:261] Added request cmpl-ec901acdc6fc4817a12485ed72a5a52c-0. INFO 03-01 19:02:19 [logger.py:42] Received request cmpl-496be807aa094c2490176b2c1137e0ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:19 [async_llm.py:261] Added request cmpl-496be807aa094c2490176b2c1137e0ed-0. INFO 03-01 19:02:20 [logger.py:42] Received request cmpl-b0edf55df616407da7054a1d9a711e5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:20 [async_llm.py:261] Added request cmpl-b0edf55df616407da7054a1d9a711e5b-0. INFO 03-01 19:02:21 [logger.py:42] Received request cmpl-008af3f272f1409aad11e58f3325fc0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:21 [async_llm.py:261] Added request cmpl-008af3f272f1409aad11e58f3325fc0b-0. INFO 03-01 19:02:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:02:22 [logger.py:42] Received request cmpl-d742bfab261a49589ff2357a6b7e018a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:22 [async_llm.py:261] Added request cmpl-d742bfab261a49589ff2357a6b7e018a-0. INFO 03-01 19:02:23 [logger.py:42] Received request cmpl-d24162361ebb4ffeb5b691c202eb18e0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:23 [async_llm.py:261] Added request cmpl-d24162361ebb4ffeb5b691c202eb18e0-0. INFO 03-01 19:02:24 [logger.py:42] Received request cmpl-d2653e9b2559497aba05658fa970bee8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:24 [async_llm.py:261] Added request cmpl-d2653e9b2559497aba05658fa970bee8-0. INFO 03-01 19:02:26 [logger.py:42] Received request cmpl-7a25b7bf97f545fabad33ab409c04031-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:26 [async_llm.py:261] Added request cmpl-7a25b7bf97f545fabad33ab409c04031-0. INFO 03-01 19:02:27 [logger.py:42] Received request cmpl-01f41ddaf4614a5a8bcdaff2de8bf4ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:27 [async_llm.py:261] Added request cmpl-01f41ddaf4614a5a8bcdaff2de8bf4ed-0. INFO 03-01 19:02:28 [logger.py:42] Received request cmpl-c3b2123c0d5c46a9a870130a05d6ddd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:28 [async_llm.py:261] Added request cmpl-c3b2123c0d5c46a9a870130a05d6ddd1-0. INFO 03-01 19:02:29 [logger.py:42] Received request cmpl-6493466a8ca24849a0029636c6a86123-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:29 [async_llm.py:261] Added request cmpl-6493466a8ca24849a0029636c6a86123-0. INFO 03-01 19:02:30 [logger.py:42] Received request cmpl-c0dd95af91f442a9b90e64b1adb89019-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:30 [async_llm.py:261] Added request cmpl-c0dd95af91f442a9b90e64b1adb89019-0. INFO 03-01 19:02:31 [logger.py:42] Received request cmpl-2ad0306800c045da8671e0e25de8866d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:31 [async_llm.py:261] Added request cmpl-2ad0306800c045da8671e0e25de8866d-0. INFO 03-01 19:02:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:02:33 [logger.py:42] Received request cmpl-7075dc41cf9e4672ababfc1dc12978c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:33 [async_llm.py:261] Added request cmpl-7075dc41cf9e4672ababfc1dc12978c3-0. INFO 03-01 19:02:34 [logger.py:42] Received request cmpl-6ad4c27b5575403faf4af054a26a08be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:34 [async_llm.py:261] Added request cmpl-6ad4c27b5575403faf4af054a26a08be-0. INFO 03-01 19:02:35 [logger.py:42] Received request cmpl-f9c90308d1a04139bab912c0a1961047-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:35 [async_llm.py:261] Added request cmpl-f9c90308d1a04139bab912c0a1961047-0. INFO 03-01 19:02:36 [logger.py:42] Received request cmpl-64dc3e1c3a194755aee36ae49d6ec4d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:36 [async_llm.py:261] Added request cmpl-64dc3e1c3a194755aee36ae49d6ec4d9-0. INFO 03-01 19:02:37 [logger.py:42] Received request cmpl-2f2505bfc5d1432a8d444ca2d30d2f10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:37 [async_llm.py:261] Added request cmpl-2f2505bfc5d1432a8d444ca2d30d2f10-0. INFO 03-01 19:02:38 [logger.py:42] Received request cmpl-0ed96e012dbf4462a9c5e1dc9d19ddd0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:38 [async_llm.py:261] Added request cmpl-0ed96e012dbf4462a9c5e1dc9d19ddd0-0. INFO 03-01 19:02:40 [logger.py:42] Received request cmpl-a0caa331d21a4304813bc4c31f702df7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:40 [async_llm.py:261] Added request cmpl-a0caa331d21a4304813bc4c31f702df7-0. INFO 03-01 19:02:41 [logger.py:42] Received request cmpl-1c4c8fbae6ed4cacb6af2aaede832619-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:41 [async_llm.py:261] Added request cmpl-1c4c8fbae6ed4cacb6af2aaede832619-0. INFO 03-01 19:02:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:02:42 [logger.py:42] Received request cmpl-53e5f7f733264e05bb1424e19b7e0c35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:42 [async_llm.py:261] Added request cmpl-53e5f7f733264e05bb1424e19b7e0c35-0. INFO 03-01 19:02:43 [logger.py:42] Received request cmpl-66b7e32b2dcd4d46a2627a8851fbf4a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:43 [async_llm.py:261] Added request cmpl-66b7e32b2dcd4d46a2627a8851fbf4a2-0. INFO 03-01 19:02:44 [logger.py:42] Received request cmpl-7823a1cd4bbc45dc8d0d3f17c25d27c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:44 [async_llm.py:261] Added request cmpl-7823a1cd4bbc45dc8d0d3f17c25d27c9-0. INFO 03-01 19:02:45 [logger.py:42] Received request cmpl-3037dd52db0a441fb180ead1b0939c8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:45 [async_llm.py:261] Added request cmpl-3037dd52db0a441fb180ead1b0939c8e-0. INFO 03-01 19:02:47 [logger.py:42] Received request cmpl-7e2ed0cd97d34782856480edbcac905b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:47 [async_llm.py:261] Added request cmpl-7e2ed0cd97d34782856480edbcac905b-0. INFO 03-01 19:02:48 [logger.py:42] Received request cmpl-eaadd3ed7b2245548a4f9529ee4e09d4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:48 [async_llm.py:261] Added request cmpl-eaadd3ed7b2245548a4f9529ee4e09d4-0. INFO 03-01 19:02:49 [logger.py:42] Received request cmpl-c93579d49b4d4bc7b2167c84cf5ae054-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:49 [async_llm.py:261] Added request cmpl-c93579d49b4d4bc7b2167c84cf5ae054-0. INFO 03-01 19:02:50 [logger.py:42] Received request cmpl-c61fe5d1588840b4a4fccf0d740b2d6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:50 [async_llm.py:261] Added request cmpl-c61fe5d1588840b4a4fccf0d740b2d6e-0. INFO 03-01 19:02:51 [logger.py:42] Received request cmpl-1530bef0f15d4ab1804db03f5fd20c31-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:51 [async_llm.py:261] Added request cmpl-1530bef0f15d4ab1804db03f5fd20c31-0. INFO 03-01 19:02:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:02:52 [logger.py:42] Received request cmpl-ae932d44afce4ee195617ce37d8c8e33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:52 [async_llm.py:261] Added request cmpl-ae932d44afce4ee195617ce37d8c8e33-0. INFO 03-01 19:02:54 [logger.py:42] Received request cmpl-3d3f178a4ccc448e87d93c6aeae51599-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:54 [async_llm.py:261] Added request cmpl-3d3f178a4ccc448e87d93c6aeae51599-0. INFO 03-01 19:02:55 [logger.py:42] Received request cmpl-4732dc4c4ddc477e9ac3d814028f6680-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:55 [async_llm.py:261] Added request cmpl-4732dc4c4ddc477e9ac3d814028f6680-0. INFO 03-01 19:02:56 [logger.py:42] Received request cmpl-91d71d35d7e34462b3736ce7962d53a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:56 [async_llm.py:261] Added request cmpl-91d71d35d7e34462b3736ce7962d53a0-0. INFO 03-01 19:02:57 [logger.py:42] Received request cmpl-7c6e3a5cd1a84041bf0151abfb4adf94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:57 [async_llm.py:261] Added request cmpl-7c6e3a5cd1a84041bf0151abfb4adf94-0. INFO 03-01 19:02:58 [logger.py:42] Received request cmpl-c3afbe763fe84698aa813cea7ecd4fc9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:58 [async_llm.py:261] Added request cmpl-c3afbe763fe84698aa813cea7ecd4fc9-0. INFO 03-01 19:02:59 [logger.py:42] Received request cmpl-7f9b6753798348128c075e45ef9b79c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:02:59 [async_llm.py:261] Added request cmpl-7f9b6753798348128c075e45ef9b79c3-0. INFO 03-01 19:03:00 [logger.py:42] Received request cmpl-0418d5aab61d45d3bbe4dd363a139669-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:00 [async_llm.py:261] Added request cmpl-0418d5aab61d45d3bbe4dd363a139669-0. INFO 03-01 19:03:02 [logger.py:42] Received request cmpl-c27b96940bb447b1a0775bc1e4451238-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:02 [async_llm.py:261] Added request cmpl-c27b96940bb447b1a0775bc1e4451238-0. INFO 03-01 19:03:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6% INFO 03-01 19:03:03 [logger.py:42] Received request cmpl-40065e90e84945cea9f66e670cd80dd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:03 [async_llm.py:261] Added request cmpl-40065e90e84945cea9f66e670cd80dd2-0. INFO 03-01 19:03:04 [logger.py:42] Received request cmpl-90fff7c32d8c45f2b19c8c2d7872053c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:04 [async_llm.py:261] Added request cmpl-90fff7c32d8c45f2b19c8c2d7872053c-0. INFO 03-01 19:03:05 [logger.py:42] Received request cmpl-71348f5e60324e1aacddc585bb797fab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:05 [async_llm.py:261] Added request cmpl-71348f5e60324e1aacddc585bb797fab-0. INFO 03-01 19:03:06 [logger.py:42] Received request cmpl-6f4325fe4940425b9317dc67ee9845d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:06 [async_llm.py:261] Added request cmpl-6f4325fe4940425b9317dc67ee9845d7-0. INFO 03-01 19:03:07 [logger.py:42] Received request cmpl-d23fc9779dc2447f8e6db0168b4578c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:07 [async_llm.py:261] Added request cmpl-d23fc9779dc2447f8e6db0168b4578c1-0. INFO 03-01 19:03:09 [logger.py:42] Received request cmpl-7956db21e79248858786dcd4188141eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:09 [async_llm.py:261] Added request cmpl-7956db21e79248858786dcd4188141eb-0. INFO 03-01 19:03:10 [logger.py:42] Received request cmpl-b26fcddb4c794de7a1d51a983b3af2ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:10 [async_llm.py:261] Added request cmpl-b26fcddb4c794de7a1d51a983b3af2ea-0. INFO 03-01 19:03:11 [logger.py:42] Received request cmpl-88d87439732a4cf8ac024303578cd8c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:11 [async_llm.py:261] Added request cmpl-88d87439732a4cf8ac024303578cd8c0-0. INFO 03-01 19:03:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:03:12 [logger.py:42] Received request cmpl-a55a72a5a5c24df3ae767a45786b0c8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:12 [async_llm.py:261] Added request cmpl-a55a72a5a5c24df3ae767a45786b0c8a-0. INFO 03-01 19:03:13 [logger.py:42] Received request cmpl-f72f21074e4148f4b2eb424317b1b2a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:13 [async_llm.py:261] Added request cmpl-f72f21074e4148f4b2eb424317b1b2a7-0. INFO 03-01 19:03:14 [logger.py:42] Received request cmpl-4f47d38232894227b7958719d1ce8f98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:14 [async_llm.py:261] Added request cmpl-4f47d38232894227b7958719d1ce8f98-0. INFO 03-01 19:03:16 [logger.py:42] Received request cmpl-634aa8d037eb40ce9100c571ae26f64c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:16 [async_llm.py:261] Added request cmpl-634aa8d037eb40ce9100c571ae26f64c-0. INFO 03-01 19:03:17 [logger.py:42] Received request cmpl-a0b2d7e595814df38a7d1e3c85605815-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:17 [async_llm.py:261] Added request cmpl-a0b2d7e595814df38a7d1e3c85605815-0. INFO 03-01 19:03:18 [logger.py:42] Received request cmpl-560f811b232a475285b5b5f7ef29e0df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:18 [async_llm.py:261] Added request cmpl-560f811b232a475285b5b5f7ef29e0df-0. INFO 03-01 19:03:19 [logger.py:42] Received request cmpl-0581c72b4cc747a5bfb2aa533c114ea8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:19 [async_llm.py:261] Added request cmpl-0581c72b4cc747a5bfb2aa533c114ea8-0. INFO 03-01 19:03:20 [logger.py:42] Received request cmpl-f67598e70e524429b63084b7ff1be68a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:20 [async_llm.py:261] Added request cmpl-f67598e70e524429b63084b7ff1be68a-0. INFO 03-01 19:03:21 [logger.py:42] Received request cmpl-85a8a895f0004845a9afb2c9f2b45add-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:21 [async_llm.py:261] Added request cmpl-85a8a895f0004845a9afb2c9f2b45add-0. INFO 03-01 19:03:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:03:23 [logger.py:42] Received request cmpl-4234b18c52854faba74d4a2276b20226-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:23 [async_llm.py:261] Added request cmpl-4234b18c52854faba74d4a2276b20226-0. INFO 03-01 19:03:24 [logger.py:42] Received request cmpl-4f863a7b86ce45b0be8e2f586a61cbc0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:24 [async_llm.py:261] Added request cmpl-4f863a7b86ce45b0be8e2f586a61cbc0-0. INFO 03-01 19:03:25 [logger.py:42] Received request cmpl-397deee5bb564a589b9378e73e923181-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:25 [async_llm.py:261] Added request cmpl-397deee5bb564a589b9378e73e923181-0. INFO 03-01 19:03:26 [logger.py:42] Received request cmpl-ce90b8318444459894847f85fa1d0071-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:26 [async_llm.py:261] Added request cmpl-ce90b8318444459894847f85fa1d0071-0. INFO 03-01 19:03:27 [logger.py:42] Received request cmpl-010bb37ee04040e1b523041f69204b22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:27 [async_llm.py:261] Added request cmpl-010bb37ee04040e1b523041f69204b22-0. INFO 03-01 19:03:28 [logger.py:42] Received request cmpl-ce3d7c62f18b4745b62d877852436c77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:28 [async_llm.py:261] Added request cmpl-ce3d7c62f18b4745b62d877852436c77-0. INFO 03-01 19:03:29 [logger.py:42] Received request cmpl-e81c6d82514640c8bd23657e823d4f69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:29 [async_llm.py:261] Added request cmpl-e81c6d82514640c8bd23657e823d4f69-0. INFO 03-01 19:03:31 [logger.py:42] Received request cmpl-338740178d6943fa9b7a6387a84045f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:31 [async_llm.py:261] Added request cmpl-338740178d6943fa9b7a6387a84045f1-0. INFO 03-01 19:03:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:03:32 [logger.py:42] Received request cmpl-37b4c29eb0fd4572aa584d3c593760a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:32 [async_llm.py:261] Added request cmpl-37b4c29eb0fd4572aa584d3c593760a2-0. INFO 03-01 19:03:33 [logger.py:42] Received request cmpl-5ab790102fbe4edaad253cd1e9c7f1be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:33 [async_llm.py:261] Added request cmpl-5ab790102fbe4edaad253cd1e9c7f1be-0. INFO 03-01 19:03:34 [logger.py:42] Received request cmpl-c145e7c3e9fa422cb2d80413c4c70d30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:34 [async_llm.py:261] Added request cmpl-c145e7c3e9fa422cb2d80413c4c70d30-0. INFO 03-01 19:03:35 [logger.py:42] Received request cmpl-a01c94c02a7d4c6185e21b954554c66b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:35 [async_llm.py:261] Added request cmpl-a01c94c02a7d4c6185e21b954554c66b-0. INFO 03-01 19:03:36 [logger.py:42] Received request cmpl-6bd4c2ddf164499681843975220b481c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:36 [async_llm.py:261] Added request cmpl-6bd4c2ddf164499681843975220b481c-0. INFO 03-01 19:03:38 [logger.py:42] Received request cmpl-20a633ad42cc4049b2aa540709ee385a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:38 [async_llm.py:261] Added request cmpl-20a633ad42cc4049b2aa540709ee385a-0. INFO 03-01 19:03:39 [logger.py:42] Received request cmpl-9c93249c806e40dba8e15aefeaa1d33e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:39 [async_llm.py:261] Added request cmpl-9c93249c806e40dba8e15aefeaa1d33e-0. INFO 03-01 19:03:40 [logger.py:42] Received request cmpl-10e4dc5ddd8141aca9e403d2edc1f335-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:40 [async_llm.py:261] Added request cmpl-10e4dc5ddd8141aca9e403d2edc1f335-0. INFO 03-01 19:03:41 [logger.py:42] Received request cmpl-0b4c1394a04048cf831dadea0f397aca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:41 [async_llm.py:261] Added request cmpl-0b4c1394a04048cf831dadea0f397aca-0. INFO 03-01 19:03:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:03:42 [logger.py:42] Received request cmpl-c40b8c40617f46a8aaf5256c9ac37e8f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:42 [async_llm.py:261] Added request cmpl-c40b8c40617f46a8aaf5256c9ac37e8f-0. INFO 03-01 19:03:43 [logger.py:42] Received request cmpl-d4764d0cc8394e739b79ecde496a8c78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:43 [async_llm.py:261] Added request cmpl-d4764d0cc8394e739b79ecde496a8c78-0. INFO 03-01 19:03:45 [logger.py:42] Received request cmpl-5eca7cc45cf142a098462a48bb681c1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:45 [async_llm.py:261] Added request cmpl-5eca7cc45cf142a098462a48bb681c1d-0. INFO 03-01 19:03:46 [logger.py:42] Received request cmpl-da3d06bf95aa4f27bad42bfd76d70736-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:46 [async_llm.py:261] Added request cmpl-da3d06bf95aa4f27bad42bfd76d70736-0. INFO 03-01 19:03:47 [logger.py:42] Received request cmpl-f1684d3cc9454cd996ccd34bbdcdf2c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:47 [async_llm.py:261] Added request cmpl-f1684d3cc9454cd996ccd34bbdcdf2c1-0. INFO 03-01 19:03:48 [logger.py:42] Received request cmpl-1ef2c1f8fb2c41e5bd166f7386a53bcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:48 [async_llm.py:261] Added request cmpl-1ef2c1f8fb2c41e5bd166f7386a53bcc-0. INFO 03-01 19:03:49 [logger.py:42] Received request cmpl-92023d724bde465fb6237fc78a9545f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:49 [async_llm.py:261] Added request cmpl-92023d724bde465fb6237fc78a9545f7-0. INFO 03-01 19:03:50 [logger.py:42] Received request cmpl-6262b0bc76e44f248a962f017c1ae5ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:50 [async_llm.py:261] Added request cmpl-6262b0bc76e44f248a962f017c1ae5ba-0. INFO 03-01 19:03:52 [logger.py:42] Received request cmpl-28380978df764ed0a515b1e3bc005071-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:52 [async_llm.py:261] Added request cmpl-28380978df764ed0a515b1e3bc005071-0. INFO 03-01 19:03:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:03:53 [logger.py:42] Received request cmpl-7cf1efd7f5ec4863803eaa65f5d0bb2f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:53 [async_llm.py:261] Added request cmpl-7cf1efd7f5ec4863803eaa65f5d0bb2f-0. INFO 03-01 19:03:54 [logger.py:42] Received request cmpl-5cef4937cd12406ab5fb64744a8e208f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:54 [async_llm.py:261] Added request cmpl-5cef4937cd12406ab5fb64744a8e208f-0. INFO 03-01 19:03:55 [logger.py:42] Received request cmpl-2bdde4a7687f46aab877473f193433d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:55 [async_llm.py:261] Added request cmpl-2bdde4a7687f46aab877473f193433d0-0. INFO 03-01 19:03:56 [logger.py:42] Received request cmpl-7e8938c3cfd944f88062e4cbf8ae6ad0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:56 [async_llm.py:261] Added request cmpl-7e8938c3cfd944f88062e4cbf8ae6ad0-0. INFO 03-01 19:03:57 [logger.py:42] Received request cmpl-1ab33297f8dd4deba6744e5f59170293-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:57 [async_llm.py:261] Added request cmpl-1ab33297f8dd4deba6744e5f59170293-0. INFO 03-01 19:03:58 [logger.py:42] Received request cmpl-90dbf59d3cbd443689545175450fcd71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:03:58 [async_llm.py:261] Added request cmpl-90dbf59d3cbd443689545175450fcd71-0. INFO 03-01 19:04:00 [logger.py:42] Received request cmpl-8e1d0c2ca95f4e04af135d27bed83aa8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:00 [async_llm.py:261] Added request cmpl-8e1d0c2ca95f4e04af135d27bed83aa8-0. INFO 03-01 19:04:01 [logger.py:42] Received request cmpl-0bdd6bee78b540ecbfb9a56a44cfdf9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:01 [async_llm.py:261] Added request cmpl-0bdd6bee78b540ecbfb9a56a44cfdf9a-0. INFO 03-01 19:04:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:04:02 [logger.py:42] Received request cmpl-72774036ef55426382a8ed591deb85d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:02 [async_llm.py:261] Added request cmpl-72774036ef55426382a8ed591deb85d7-0. INFO 03-01 19:04:03 [logger.py:42] Received request cmpl-5a7b31a62beb4473af7bc568b668cc92-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:03 [async_llm.py:261] Added request cmpl-5a7b31a62beb4473af7bc568b668cc92-0. INFO 03-01 19:04:04 [logger.py:42] Received request cmpl-ef36f39ab47a4b77b16db1120b414b2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:04 [async_llm.py:261] Added request cmpl-ef36f39ab47a4b77b16db1120b414b2b-0. INFO 03-01 19:04:05 [logger.py:42] Received request cmpl-d478cea84cf844658682de7ce41093b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:05 [async_llm.py:261] Added request cmpl-d478cea84cf844658682de7ce41093b1-0. INFO 03-01 19:04:07 [logger.py:42] Received request cmpl-a1e079c10fbc418482f0fb086b83e40f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:07 [async_llm.py:261] Added request cmpl-a1e079c10fbc418482f0fb086b83e40f-0. INFO 03-01 19:04:08 [logger.py:42] Received request cmpl-a3a888a24efe4ef4a7a90549fe988217-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:08 [async_llm.py:261] Added request cmpl-a3a888a24efe4ef4a7a90549fe988217-0. INFO 03-01 19:04:09 [logger.py:42] Received request cmpl-4792be70c23340be9605d9cfd8e80880-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:09 [async_llm.py:261] Added request cmpl-4792be70c23340be9605d9cfd8e80880-0. INFO 03-01 19:04:10 [logger.py:42] Received request cmpl-1c5b5542463743f9a01a7b0287ac7f87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:10 [async_llm.py:261] Added request cmpl-1c5b5542463743f9a01a7b0287ac7f87-0. INFO 03-01 19:04:11 [logger.py:42] Received request cmpl-743f9cab2bc54b42adec5a77a1081334-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:11 [async_llm.py:261] Added request cmpl-743f9cab2bc54b42adec5a77a1081334-0. INFO 03-01 19:04:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:04:12 [logger.py:42] Received request cmpl-f5867c91c2fe43eb8c50f1ac0a60583a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:12 [async_llm.py:261] Added request cmpl-f5867c91c2fe43eb8c50f1ac0a60583a-0. INFO 03-01 19:04:14 [logger.py:42] Received request cmpl-2aaca6e07a65473d9bed28c53207359b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:14 [async_llm.py:261] Added request cmpl-2aaca6e07a65473d9bed28c53207359b-0. INFO 03-01 19:04:15 [logger.py:42] Received request cmpl-3521c934b6b94bafaed6b4dde92b0660-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:15 [async_llm.py:261] Added request cmpl-3521c934b6b94bafaed6b4dde92b0660-0. INFO 03-01 19:04:16 [logger.py:42] Received request cmpl-27e24b33bd01408fb0b998d93bf14328-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:16 [async_llm.py:261] Added request cmpl-27e24b33bd01408fb0b998d93bf14328-0. INFO 03-01 19:04:17 [logger.py:42] Received request cmpl-e18c223e8e894fca98254a4084393017-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:17 [async_llm.py:261] Added request cmpl-e18c223e8e894fca98254a4084393017-0. INFO 03-01 19:04:18 [logger.py:42] Received request cmpl-36bffa84ceac430787e87c442a5a5056-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:18 [async_llm.py:261] Added request cmpl-36bffa84ceac430787e87c442a5a5056-0. INFO 03-01 19:04:19 [logger.py:42] Received request cmpl-078d3393b50c491d813b7a51558e8f30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:19 [async_llm.py:261] Added request cmpl-078d3393b50c491d813b7a51558e8f30-0. INFO 03-01 19:04:20 [logger.py:42] Received request cmpl-afb2e63e9b2642d6ad6db973a2d55a2d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:20 [async_llm.py:261] Added request cmpl-afb2e63e9b2642d6ad6db973a2d55a2d-0. INFO 03-01 19:04:22 [logger.py:42] Received request cmpl-4224a30360004eda9e3a9b6f28d41f10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:22 [async_llm.py:261] Added request cmpl-4224a30360004eda9e3a9b6f28d41f10-0. INFO 03-01 19:04:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:04:23 [logger.py:42] Received request cmpl-2f227a288a3245a1a8de42d503587d0b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:23 [async_llm.py:261] Added request cmpl-2f227a288a3245a1a8de42d503587d0b-0. INFO 03-01 19:04:24 [logger.py:42] Received request cmpl-d64f33c80be34cc3ae2ba9ca6a00834d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:24 [async_llm.py:261] Added request cmpl-d64f33c80be34cc3ae2ba9ca6a00834d-0. INFO 03-01 19:04:25 [logger.py:42] Received request cmpl-49c3883c27ef4a10b8be05771528b0b4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:25 [async_llm.py:261] Added request cmpl-49c3883c27ef4a10b8be05771528b0b4-0. INFO 03-01 19:04:26 [logger.py:42] Received request cmpl-512fa89107df444483995e18100710b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:26 [async_llm.py:261] Added request cmpl-512fa89107df444483995e18100710b0-0. INFO 03-01 19:04:27 [logger.py:42] Received request cmpl-7b59d1a50d104dbf98c9e1425a22308d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:27 [async_llm.py:261] Added request cmpl-7b59d1a50d104dbf98c9e1425a22308d-0. INFO 03-01 19:04:29 [logger.py:42] Received request cmpl-4e4e164a8194466f8b7e027d167b757f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:29 [async_llm.py:261] Added request cmpl-4e4e164a8194466f8b7e027d167b757f-0. INFO 03-01 19:04:30 [logger.py:42] Received request cmpl-ef8d643b71f2489fa710776c30202810-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:30 [async_llm.py:261] Added request cmpl-ef8d643b71f2489fa710776c30202810-0. INFO 03-01 19:04:31 [logger.py:42] Received request cmpl-d2ee3cc834bc456dbf1c63e24fd1da55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:31 [async_llm.py:261] Added request cmpl-d2ee3cc834bc456dbf1c63e24fd1da55-0. INFO 03-01 19:04:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:04:32 [logger.py:42] Received request cmpl-960abeeab4fc40c5862a4c98d31765be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:32 [async_llm.py:261] Added request cmpl-960abeeab4fc40c5862a4c98d31765be-0. INFO 03-01 19:04:33 [logger.py:42] Received request cmpl-531fd42396914ebe990c7dad8e6da7bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:33 [async_llm.py:261] Added request cmpl-531fd42396914ebe990c7dad8e6da7bd-0. INFO 03-01 19:04:34 [logger.py:42] Received request cmpl-faca0b64de4d45b79793efdef87ced88-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:34 [async_llm.py:261] Added request cmpl-faca0b64de4d45b79793efdef87ced88-0. INFO 03-01 19:04:36 [logger.py:42] Received request cmpl-f6b0f02d656a4408bc8dece2390f2a59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:36 [async_llm.py:261] Added request cmpl-f6b0f02d656a4408bc8dece2390f2a59-0. INFO 03-01 19:04:37 [logger.py:42] Received request cmpl-438047e71b854100886894eba220fd9a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:37 [async_llm.py:261] Added request cmpl-438047e71b854100886894eba220fd9a-0. INFO 03-01 19:04:38 [logger.py:42] Received request cmpl-b1904c45ffda40cf9f45c6527866fe52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:38 [async_llm.py:261] Added request cmpl-b1904c45ffda40cf9f45c6527866fe52-0. INFO 03-01 19:04:39 [logger.py:42] Received request cmpl-a7997d6b35104bd3bfb43807d6a59128-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:39 [async_llm.py:261] Added request cmpl-a7997d6b35104bd3bfb43807d6a59128-0. INFO 03-01 19:04:40 [logger.py:42] Received request cmpl-06ef27e177be4371bca24c39dfb0acf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:40 [async_llm.py:261] Added request cmpl-06ef27e177be4371bca24c39dfb0acf6-0. INFO 03-01 19:04:41 [logger.py:42] Received request cmpl-29d25c7ceb7842d78c786447b1ea36f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:41 [async_llm.py:261] Added request cmpl-29d25c7ceb7842d78c786447b1ea36f1-0. INFO 03-01 19:04:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:04:43 [logger.py:42] Received request cmpl-ebf0c7bd07c147cda011c2f7d1598c90-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:43 [async_llm.py:261] Added request cmpl-ebf0c7bd07c147cda011c2f7d1598c90-0. INFO 03-01 19:04:44 [logger.py:42] Received request cmpl-ba3051a7e4d0464eb082fca4b6402d8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:44 [async_llm.py:261] Added request cmpl-ba3051a7e4d0464eb082fca4b6402d8a-0. INFO 03-01 19:04:45 [logger.py:42] Received request cmpl-76a68c9aaebc4bc18fa8424e8a6c1890-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:45 [async_llm.py:261] Added request cmpl-76a68c9aaebc4bc18fa8424e8a6c1890-0. INFO 03-01 19:04:46 [logger.py:42] Received request cmpl-8af73b448ae44404843608cf00af847e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:46 [async_llm.py:261] Added request cmpl-8af73b448ae44404843608cf00af847e-0. INFO 03-01 19:04:47 [logger.py:42] Received request cmpl-2ff504318fd94ce0a374b7b1163fc488-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:47 [async_llm.py:261] Added request cmpl-2ff504318fd94ce0a374b7b1163fc488-0. INFO 03-01 19:04:48 [logger.py:42] Received request cmpl-8010960cd8b44b828636bc1d446df465-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:48 [async_llm.py:261] Added request cmpl-8010960cd8b44b828636bc1d446df465-0. INFO 03-01 19:04:49 [logger.py:42] Received request cmpl-7b09f9022c5949e5835b120119e94917-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:49 [async_llm.py:261] Added request cmpl-7b09f9022c5949e5835b120119e94917-0. INFO 03-01 19:04:51 [logger.py:42] Received request cmpl-16590d4f5b49481291de6bd141523973-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:51 [async_llm.py:261] Added request cmpl-16590d4f5b49481291de6bd141523973-0. INFO 03-01 19:04:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:04:52 [logger.py:42] Received request cmpl-b081279e330349a994ce8cce6d807303-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:52 [async_llm.py:261] Added request cmpl-b081279e330349a994ce8cce6d807303-0. INFO 03-01 19:04:53 [logger.py:42] Received request cmpl-6ace85d8d92e4b03aa66ad7d43cd0391-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:53 [async_llm.py:261] Added request cmpl-6ace85d8d92e4b03aa66ad7d43cd0391-0. INFO 03-01 19:04:54 [logger.py:42] Received request cmpl-e0001d09bae74e9cb473a6b784bf0904-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:54 [async_llm.py:261] Added request cmpl-e0001d09bae74e9cb473a6b784bf0904-0. INFO 03-01 19:04:55 [logger.py:42] Received request cmpl-9892896d1bff41f98faf186267795a33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:55 [async_llm.py:261] Added request cmpl-9892896d1bff41f98faf186267795a33-0. INFO 03-01 19:04:56 [logger.py:42] Received request cmpl-9402c67706bd4dc3b7f40e910dadfbd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:56 [async_llm.py:261] Added request cmpl-9402c67706bd4dc3b7f40e910dadfbd1-0. INFO 03-01 19:04:58 [logger.py:42] Received request cmpl-7c9f4601c4e8414998cf6bc5b84f59af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:58 [async_llm.py:261] Added request cmpl-7c9f4601c4e8414998cf6bc5b84f59af-0. INFO 03-01 19:04:59 [logger.py:42] Received request cmpl-ab494b6506484551b02cddbdda2094ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:04:59 [async_llm.py:261] Added request cmpl-ab494b6506484551b02cddbdda2094ad-0. INFO 03-01 19:05:00 [logger.py:42] Received request cmpl-7e52422730df42479c73f658864239a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:00 [async_llm.py:261] Added request cmpl-7e52422730df42479c73f658864239a6-0. INFO 03-01 19:05:01 [logger.py:42] Received request cmpl-e8fed74037e24262a335202353b223a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:01 [async_llm.py:261] Added request cmpl-e8fed74037e24262a335202353b223a7-0. INFO 03-01 19:05:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:05:02 [logger.py:42] Received request cmpl-292256f090314cad87cefa3ea5a12f1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:02 [async_llm.py:261] Added request cmpl-292256f090314cad87cefa3ea5a12f1f-0. INFO 03-01 19:05:03 [logger.py:42] Received request cmpl-ce5d4219cbf044c5a03c9f4d8deda03f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:03 [async_llm.py:261] Added request cmpl-ce5d4219cbf044c5a03c9f4d8deda03f-0. INFO 03-01 19:05:05 [logger.py:42] Received request cmpl-0c493cdd60344f3f95281ee3af7fc6d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:05 [async_llm.py:261] Added request cmpl-0c493cdd60344f3f95281ee3af7fc6d1-0. INFO 03-01 19:05:06 [logger.py:42] Received request cmpl-ae715c2fd3c743b298497d7fe6e7e69c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:06 [async_llm.py:261] Added request cmpl-ae715c2fd3c743b298497d7fe6e7e69c-0. INFO 03-01 19:05:07 [logger.py:42] Received request cmpl-43db3127fadf4c088856c983207611ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:07 [async_llm.py:261] Added request cmpl-43db3127fadf4c088856c983207611ee-0. INFO 03-01 19:05:08 [logger.py:42] Received request cmpl-bdede84b087d4add97da12e0c859e5a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:08 [async_llm.py:261] Added request cmpl-bdede84b087d4add97da12e0c859e5a4-0. INFO 03-01 19:05:09 [logger.py:42] Received request cmpl-2101c80c9113410f89f00ded41d24584-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:09 [async_llm.py:261] Added request cmpl-2101c80c9113410f89f00ded41d24584-0. INFO 03-01 19:05:10 [logger.py:42] Received request cmpl-76ce732229c2451aaace7b4905989d25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:10 [async_llm.py:261] Added request cmpl-76ce732229c2451aaace7b4905989d25-0. INFO 03-01 19:05:12 [logger.py:42] Received request cmpl-be23f53cb6e6474ca39f88f4611dde19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:12 [async_llm.py:261] Added request cmpl-be23f53cb6e6474ca39f88f4611dde19-0. INFO 03-01 19:05:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:05:13 [logger.py:42] Received request cmpl-9d890fecb73f4f919a88a761d923a2eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:13 [async_llm.py:261] Added request cmpl-9d890fecb73f4f919a88a761d923a2eb-0. INFO 03-01 19:05:14 [logger.py:42] Received request cmpl-992792a2bb7947e4a1fdedd61cc99b29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:14 [async_llm.py:261] Added request cmpl-992792a2bb7947e4a1fdedd61cc99b29-0. INFO 03-01 19:05:15 [logger.py:42] Received request cmpl-8c89f21a00d1430c9b6fa254a9e91239-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:15 [async_llm.py:261] Added request cmpl-8c89f21a00d1430c9b6fa254a9e91239-0. INFO 03-01 19:05:16 [logger.py:42] Received request cmpl-8c9d714f2b7f4a978dabc0a7a5cfb852-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:16 [async_llm.py:261] Added request cmpl-8c9d714f2b7f4a978dabc0a7a5cfb852-0. INFO 03-01 19:05:17 [logger.py:42] Received request cmpl-258b59057a994d3d9357f8f9470c458b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:17 [async_llm.py:261] Added request cmpl-258b59057a994d3d9357f8f9470c458b-0. INFO 03-01 19:05:19 [logger.py:42] Received request cmpl-a65bedd9ffc64c0d8660e06aa7622a46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:19 [async_llm.py:261] Added request cmpl-a65bedd9ffc64c0d8660e06aa7622a46-0. INFO 03-01 19:05:20 [logger.py:42] Received request cmpl-3c286ad67a204734bee0ece6ba434eec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:20 [async_llm.py:261] Added request cmpl-3c286ad67a204734bee0ece6ba434eec-0. INFO 03-01 19:05:21 [logger.py:42] Received request cmpl-715e260ceaf648238abf42e1bffdc317-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:21 [async_llm.py:261] Added request cmpl-715e260ceaf648238abf42e1bffdc317-0. INFO 03-01 19:05:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:05:22 [logger.py:42] Received request cmpl-3a045e36d83f4c0a8d85b48d0c587328-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:22 [async_llm.py:261] Added request cmpl-3a045e36d83f4c0a8d85b48d0c587328-0. INFO 03-01 19:05:23 [logger.py:42] Received request cmpl-f4e300220ab0492fb29cf6ac3ec59d57-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:23 [async_llm.py:261] Added request cmpl-f4e300220ab0492fb29cf6ac3ec59d57-0. INFO 03-01 19:05:25 [logger.py:42] Received request cmpl-a10aeb984c994b3999a80e93298ae10e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:25 [async_llm.py:261] Added request cmpl-a10aeb984c994b3999a80e93298ae10e-0. INFO 03-01 19:05:26 [logger.py:42] Received request cmpl-6a23bd6527914888a69dbfa1b0ab3e7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:26 [async_llm.py:261] Added request cmpl-6a23bd6527914888a69dbfa1b0ab3e7d-0. INFO 03-01 19:05:27 [logger.py:42] Received request cmpl-25cb6ef264be463f8b694a48c11f4995-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:27 [async_llm.py:261] Added request cmpl-25cb6ef264be463f8b694a48c11f4995-0. INFO 03-01 19:05:28 [logger.py:42] Received request cmpl-254cfa97caa74da8823c350b3e6b2ef0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:28 [async_llm.py:261] Added request cmpl-254cfa97caa74da8823c350b3e6b2ef0-0. INFO 03-01 19:05:29 [logger.py:42] Received request cmpl-135b57bd18d444318bf7ec47c7318a6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:29 [async_llm.py:261] Added request cmpl-135b57bd18d444318bf7ec47c7318a6d-0. INFO 03-01 19:05:30 [logger.py:42] Received request cmpl-20f3edf747534786b87e7f05ad358a7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:30 [async_llm.py:261] Added request cmpl-20f3edf747534786b87e7f05ad358a7a-0. INFO 03-01 19:05:32 [logger.py:42] Received request cmpl-f1a1e352cffb4fb1b0b37d76d9c3cb41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:32 [async_llm.py:261] Added request cmpl-f1a1e352cffb4fb1b0b37d76d9c3cb41-0. INFO 03-01 19:05:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6% INFO 03-01 19:05:33 [logger.py:42] Received request cmpl-7e9b587ef5934999931f4bf4012cc377-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:33 [async_llm.py:261] Added request cmpl-7e9b587ef5934999931f4bf4012cc377-0. INFO 03-01 19:05:34 [logger.py:42] Received request cmpl-12a0280ec3f2451b880f0cff273cf095-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:34 [async_llm.py:261] Added request cmpl-12a0280ec3f2451b880f0cff273cf095-0. INFO 03-01 19:05:35 [logger.py:42] Received request cmpl-8beebb7f51f445dc9a110abb812b70ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:35 [async_llm.py:261] Added request cmpl-8beebb7f51f445dc9a110abb812b70ac-0. INFO 03-01 19:05:36 [logger.py:42] Received request cmpl-8920a9e3466842b0870dbe836cf29922-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:36 [async_llm.py:261] Added request cmpl-8920a9e3466842b0870dbe836cf29922-0. INFO 03-01 19:05:37 [logger.py:42] Received request cmpl-1ea8d8eb3e0b482db5e07741bed9859e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:37 [async_llm.py:261] Added request cmpl-1ea8d8eb3e0b482db5e07741bed9859e-0. INFO 03-01 19:05:39 [logger.py:42] Received request cmpl-58223c07a3534f68b5b2ecdad99ae52e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:39 [async_llm.py:261] Added request cmpl-58223c07a3534f68b5b2ecdad99ae52e-0. INFO 03-01 19:05:40 [logger.py:42] Received request cmpl-44afdb6a2bc34725b181faff920f9721-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:40 [async_llm.py:261] Added request cmpl-44afdb6a2bc34725b181faff920f9721-0. INFO 03-01 19:05:41 [logger.py:42] Received request cmpl-93e74859b26847ad959d537db14c071e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:41 [async_llm.py:261] Added request cmpl-93e74859b26847ad959d537db14c071e-0. INFO 03-01 19:05:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:05:42 [logger.py:42] Received request cmpl-6df2b9e316734a6f93b488f6acb45e04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:42 [async_llm.py:261] Added request cmpl-6df2b9e316734a6f93b488f6acb45e04-0. INFO 03-01 19:05:43 [logger.py:42] Received request cmpl-7220096a02794583a510c552dbaf3c45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:43 [async_llm.py:261] Added request cmpl-7220096a02794583a510c552dbaf3c45-0. INFO 03-01 19:05:44 [logger.py:42] Received request cmpl-684a1d7419cb40a4b925ef09677df3c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:44 [async_llm.py:261] Added request cmpl-684a1d7419cb40a4b925ef09677df3c7-0. INFO 03-01 19:05:46 [logger.py:42] Received request cmpl-1e6fc9e4b5304b91a5c5832e9587f0d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:46 [async_llm.py:261] Added request cmpl-1e6fc9e4b5304b91a5c5832e9587f0d7-0. INFO 03-01 19:05:47 [logger.py:42] Received request cmpl-2b4b2b7412bb47958b4f14f76d5ebf87-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:47 [async_llm.py:261] Added request cmpl-2b4b2b7412bb47958b4f14f76d5ebf87-0. INFO 03-01 19:05:48 [logger.py:42] Received request cmpl-39fd91c4d9c848b9a8cf854ad14e8642-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:48 [async_llm.py:261] Added request cmpl-39fd91c4d9c848b9a8cf854ad14e8642-0. INFO 03-01 19:05:49 [logger.py:42] Received request cmpl-bdbe5a37e4fc47f594e06a6aab80c537-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:49 [async_llm.py:261] Added request cmpl-bdbe5a37e4fc47f594e06a6aab80c537-0. INFO 03-01 19:05:50 [logger.py:42] Received request cmpl-24e3a731a2214fc6aee8ed6685033149-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:50 [async_llm.py:261] Added request cmpl-24e3a731a2214fc6aee8ed6685033149-0. INFO 03-01 19:05:52 [logger.py:42] Received request cmpl-6590b3df55c04f039b94eecb60e0b98f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:52 [async_llm.py:261] Added request cmpl-6590b3df55c04f039b94eecb60e0b98f-0. INFO 03-01 19:05:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:05:53 [logger.py:42] Received request cmpl-0d14deed763a4b7b91cec41825d06346-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:53 [async_llm.py:261] Added request cmpl-0d14deed763a4b7b91cec41825d06346-0. INFO 03-01 19:05:54 [logger.py:42] Received request cmpl-ad4a90e7d9654a2faad78a86db1123ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:54 [async_llm.py:261] Added request cmpl-ad4a90e7d9654a2faad78a86db1123ea-0. INFO 03-01 19:05:55 [logger.py:42] Received request cmpl-7228c681ce5f4dd59aae4b613843dda4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:55 [async_llm.py:261] Added request cmpl-7228c681ce5f4dd59aae4b613843dda4-0. INFO 03-01 19:05:56 [logger.py:42] Received request cmpl-1328e01007e143c4b4bedba648537bd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:56 [async_llm.py:261] Added request cmpl-1328e01007e143c4b4bedba648537bd8-0. INFO 03-01 19:05:57 [logger.py:42] Received request cmpl-69c993a2f0824693b0aab603c1b1f274-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:57 [async_llm.py:261] Added request cmpl-69c993a2f0824693b0aab603c1b1f274-0. INFO 03-01 19:05:59 [logger.py:42] Received request cmpl-a7d19cff352c45cb8925c11dc9d98782-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:05:59 [async_llm.py:261] Added request cmpl-a7d19cff352c45cb8925c11dc9d98782-0. INFO 03-01 19:06:00 [logger.py:42] Received request cmpl-8123f4e9bd1f47319d84b2e79652ccc5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:00 [async_llm.py:261] Added request cmpl-8123f4e9bd1f47319d84b2e79652ccc5-0. INFO 03-01 19:06:01 [logger.py:42] Received request cmpl-cc1b821344474018bf98db78c3b8310b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:01 [async_llm.py:261] Added request cmpl-cc1b821344474018bf98db78c3b8310b-0. INFO 03-01 19:06:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:06:02 [logger.py:42] Received request cmpl-ce015747116e4359a84f43c5ecb94a4a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:02 [async_llm.py:261] Added request cmpl-ce015747116e4359a84f43c5ecb94a4a-0. INFO 03-01 19:06:03 [logger.py:42] Received request cmpl-d856735e4e0c47e586e840fe4e0fc5cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:03 [async_llm.py:261] Added request cmpl-d856735e4e0c47e586e840fe4e0fc5cb-0. INFO 03-01 19:06:05 [logger.py:42] Received request cmpl-3ff26761c87348a189b7b4f2cb0dcb98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:05 [async_llm.py:261] Added request cmpl-3ff26761c87348a189b7b4f2cb0dcb98-0. INFO 03-01 19:06:06 [logger.py:42] Received request cmpl-011740a78035442d972ed524b35fac32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:06 [async_llm.py:261] Added request cmpl-011740a78035442d972ed524b35fac32-0. INFO 03-01 19:06:07 [logger.py:42] Received request cmpl-6868c0a1508c4265a5f4ad3f616bf002-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:07 [async_llm.py:261] Added request cmpl-6868c0a1508c4265a5f4ad3f616bf002-0. INFO 03-01 19:06:08 [logger.py:42] Received request cmpl-4ae2129958e74d2d866eb1f3c5ced801-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:08 [async_llm.py:261] Added request cmpl-4ae2129958e74d2d866eb1f3c5ced801-0. INFO 03-01 19:06:09 [logger.py:42] Received request cmpl-970280bea7664a478354e2f3e5579e84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:09 [async_llm.py:261] Added request cmpl-970280bea7664a478354e2f3e5579e84-0. INFO 03-01 19:06:10 [logger.py:42] Received request cmpl-1d6e9c28306c4ac18bbe29469376b25d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:10 [async_llm.py:261] Added request cmpl-1d6e9c28306c4ac18bbe29469376b25d-0. INFO 03-01 19:06:12 [logger.py:42] Received request cmpl-75f06dedbf3046e4bef0a20609595027-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:12 [async_llm.py:261] Added request cmpl-75f06dedbf3046e4bef0a20609595027-0. INFO 03-01 19:06:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:06:13 [logger.py:42] Received request cmpl-08956a82c1a54c61b37e839114609a27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:13 [async_llm.py:261] Added request cmpl-08956a82c1a54c61b37e839114609a27-0. INFO 03-01 19:06:14 [logger.py:42] Received request cmpl-6b1eb38587564b1d9e3696cd86add412-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:14 [async_llm.py:261] Added request cmpl-6b1eb38587564b1d9e3696cd86add412-0. INFO 03-01 19:06:15 [logger.py:42] Received request cmpl-08c06c71b61b4323b891fcd4212ee1c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:15 [async_llm.py:261] Added request cmpl-08c06c71b61b4323b891fcd4212ee1c0-0. INFO 03-01 19:06:16 [logger.py:42] Received request cmpl-541e837711ba43bc9c6d900aa5f344e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:16 [async_llm.py:261] Added request cmpl-541e837711ba43bc9c6d900aa5f344e3-0. INFO 03-01 19:06:17 [logger.py:42] Received request cmpl-4c1fa71fcb9a4bb49e34662059c24dba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:17 [async_llm.py:261] Added request cmpl-4c1fa71fcb9a4bb49e34662059c24dba-0. INFO 03-01 19:06:18 [logger.py:42] Received request cmpl-86681f7f7b134a2ea310319c9914b8d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:18 [async_llm.py:261] Added request cmpl-86681f7f7b134a2ea310319c9914b8d3-0. INFO 03-01 19:06:20 [logger.py:42] Received request cmpl-fd4707c3915d4a48b974f91336876772-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:20 [async_llm.py:261] Added request cmpl-fd4707c3915d4a48b974f91336876772-0. INFO 03-01 19:06:21 [logger.py:42] Received request cmpl-7711094756934642938f82e0dcc9a510-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:21 [async_llm.py:261] Added request cmpl-7711094756934642938f82e0dcc9a510-0. INFO 03-01 19:06:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:06:22 [logger.py:42] Received request cmpl-fc17fd9f8e064963b9d1175bee208a5a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:22 [async_llm.py:261] Added request cmpl-fc17fd9f8e064963b9d1175bee208a5a-0. INFO 03-01 19:06:23 [logger.py:42] Received request cmpl-4d1f28f7586547079eccc87ae24e1215-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:23 [async_llm.py:261] Added request cmpl-4d1f28f7586547079eccc87ae24e1215-0. INFO 03-01 19:06:24 [logger.py:42] Received request cmpl-f60082efc5f74031a6f838f2af55c41d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:24 [async_llm.py:261] Added request cmpl-f60082efc5f74031a6f838f2af55c41d-0. INFO 03-01 19:06:25 [logger.py:42] Received request cmpl-e0875ac4062f457da224c94dd50a5cda-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:25 [async_llm.py:261] Added request cmpl-e0875ac4062f457da224c94dd50a5cda-0. INFO 03-01 19:06:27 [logger.py:42] Received request cmpl-569643e841514621901ab9b684509449-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:27 [async_llm.py:261] Added request cmpl-569643e841514621901ab9b684509449-0. INFO 03-01 19:06:28 [logger.py:42] Received request cmpl-555919a5ebab42448a6492fcd63cfa39-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:28 [async_llm.py:261] Added request cmpl-555919a5ebab42448a6492fcd63cfa39-0. INFO 03-01 19:06:29 [logger.py:42] Received request cmpl-209e8e42a0454ec997a2fa050b4ec6d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:29 [async_llm.py:261] Added request cmpl-209e8e42a0454ec997a2fa050b4ec6d1-0. INFO 03-01 19:06:30 [logger.py:42] Received request cmpl-432afb73f5604818a0e8c53ed5d537f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:30 [async_llm.py:261] Added request cmpl-432afb73f5604818a0e8c53ed5d537f1-0. INFO 03-01 19:06:31 [logger.py:42] Received request cmpl-c230512df5a3445cb35074aa9ee3adb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:31 [async_llm.py:261] Added request cmpl-c230512df5a3445cb35074aa9ee3adb7-0. INFO 03-01 19:06:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:06:32 [logger.py:42] Received request cmpl-8c90af66624a4874ab341c50555b04a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:32 [async_llm.py:261] Added request cmpl-8c90af66624a4874ab341c50555b04a0-0. INFO 03-01 19:06:34 [logger.py:42] Received request cmpl-0da1bcd657fa44c69dc3d5e3ec6dc806-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:34 [async_llm.py:261] Added request cmpl-0da1bcd657fa44c69dc3d5e3ec6dc806-0. INFO 03-01 19:06:35 [logger.py:42] Received request cmpl-60283f91c8644087bb8bfc777378b466-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:35 [async_llm.py:261] Added request cmpl-60283f91c8644087bb8bfc777378b466-0. INFO 03-01 19:06:36 [logger.py:42] Received request cmpl-d3092c8ab4db49a0a4cbde1782fd1477-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:36 [async_llm.py:261] Added request cmpl-d3092c8ab4db49a0a4cbde1782fd1477-0. INFO 03-01 19:06:37 [logger.py:42] Received request cmpl-3e2c2e5a59ea420ebf345bcfab18e78f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:37 [async_llm.py:261] Added request cmpl-3e2c2e5a59ea420ebf345bcfab18e78f-0. INFO 03-01 19:06:38 [logger.py:42] Received request cmpl-a8509e272b2340c4bd45c5dab42413a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:38 [async_llm.py:261] Added request cmpl-a8509e272b2340c4bd45c5dab42413a8-0. INFO 03-01 19:06:39 [logger.py:42] Received request cmpl-45a8085cb20144a581376038215195b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:39 [async_llm.py:261] Added request cmpl-45a8085cb20144a581376038215195b5-0. INFO 03-01 19:06:41 [logger.py:42] Received request cmpl-2c5c0161479b468cac9bac0e08246512-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:41 [async_llm.py:261] Added request cmpl-2c5c0161479b468cac9bac0e08246512-0. INFO 03-01 19:06:42 [logger.py:42] Received request cmpl-4462594ac4fe454fade377167a092d6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:42 [async_llm.py:261] Added request cmpl-4462594ac4fe454fade377167a092d6b-0. INFO 03-01 19:06:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:06:43 [logger.py:42] Received request cmpl-d7f1b6c01c564b2ab218b2573f6dc90f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:43 [async_llm.py:261] Added request cmpl-d7f1b6c01c564b2ab218b2573f6dc90f-0. INFO 03-01 19:06:44 [logger.py:42] Received request cmpl-b8383ee9b5914c2c95797d024ed382a1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:44 [async_llm.py:261] Added request cmpl-b8383ee9b5914c2c95797d024ed382a1-0. INFO 03-01 19:06:45 [logger.py:42] Received request cmpl-e4dcc2d5f2074519bb6169a5942af88f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:45 [async_llm.py:261] Added request cmpl-e4dcc2d5f2074519bb6169a5942af88f-0. INFO 03-01 19:06:46 [logger.py:42] Received request cmpl-b2101aa195a74e469c08122d701103e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:46 [async_llm.py:261] Added request cmpl-b2101aa195a74e469c08122d701103e6-0. INFO 03-01 19:06:47 [logger.py:42] Received request cmpl-230de682c0cf407b981047cce3e85084-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:47 [async_llm.py:261] Added request cmpl-230de682c0cf407b981047cce3e85084-0. INFO 03-01 19:06:49 [logger.py:42] Received request cmpl-8c9982000e93496fb6ad5bbb83bf0720-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:49 [async_llm.py:261] Added request cmpl-8c9982000e93496fb6ad5bbb83bf0720-0. INFO 03-01 19:06:50 [logger.py:42] Received request cmpl-bceb4e6ed1804530b37956d58557a412-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:50 [async_llm.py:261] Added request cmpl-bceb4e6ed1804530b37956d58557a412-0. INFO 03-01 19:06:51 [logger.py:42] Received request cmpl-48320d1fb7174f9cab1cf5275206baa1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:51 [async_llm.py:261] Added request cmpl-48320d1fb7174f9cab1cf5275206baa1-0. INFO 03-01 19:06:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:06:52 [logger.py:42] Received request cmpl-3e54a2422954446ca51d3fc5254bbd08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:52 [async_llm.py:261] Added request cmpl-3e54a2422954446ca51d3fc5254bbd08-0. INFO 03-01 19:06:53 [logger.py:42] Received request cmpl-946c953a117d42b792bf2693a51b0407-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:53 [async_llm.py:261] Added request cmpl-946c953a117d42b792bf2693a51b0407-0. INFO 03-01 19:06:54 [logger.py:42] Received request cmpl-782a0e13be484082a189aeb900aa27fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:54 [async_llm.py:261] Added request cmpl-782a0e13be484082a189aeb900aa27fd-0. INFO 03-01 19:06:56 [logger.py:42] Received request cmpl-dec6349ff36d4695a361c80447c7e927-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:56 [async_llm.py:261] Added request cmpl-dec6349ff36d4695a361c80447c7e927-0. INFO 03-01 19:06:57 [logger.py:42] Received request cmpl-0ea2541342554fa19c38ea8ff2165833-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:57 [async_llm.py:261] Added request cmpl-0ea2541342554fa19c38ea8ff2165833-0. INFO 03-01 19:06:58 [logger.py:42] Received request cmpl-16d4de83737c48cb992a38ae046a4211-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:58 [async_llm.py:261] Added request cmpl-16d4de83737c48cb992a38ae046a4211-0. INFO 03-01 19:06:59 [logger.py:42] Received request cmpl-31832988a7f44540a4fdb5a8ffbe9350-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:06:59 [async_llm.py:261] Added request cmpl-31832988a7f44540a4fdb5a8ffbe9350-0. INFO 03-01 19:07:00 [logger.py:42] Received request cmpl-030da7e88d3e49029cd86b7e4cd56884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:00 [async_llm.py:261] Added request cmpl-030da7e88d3e49029cd86b7e4cd56884-0. INFO 03-01 19:07:01 [logger.py:42] Received request cmpl-d3e25c8df6a14bf4998267b3efe74c5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:01 [async_llm.py:261] Added request cmpl-d3e25c8df6a14bf4998267b3efe74c5b-0. INFO 03-01 19:07:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:07:03 [logger.py:42] Received request cmpl-c4ac7f43a6194b58b472dc749fd1cbc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:03 [async_llm.py:261] Added request cmpl-c4ac7f43a6194b58b472dc749fd1cbc3-0. INFO 03-01 19:07:04 [logger.py:42] Received request cmpl-a2d1616f107f444e8626b86c10cd4866-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:04 [async_llm.py:261] Added request cmpl-a2d1616f107f444e8626b86c10cd4866-0. INFO 03-01 19:07:05 [logger.py:42] Received request cmpl-1496b89165e543eb899ae3fea2a3b236-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:05 [async_llm.py:261] Added request cmpl-1496b89165e543eb899ae3fea2a3b236-0. INFO 03-01 19:07:06 [logger.py:42] Received request cmpl-8c0a58b7a014412bafac09260645eb23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:06 [async_llm.py:261] Added request cmpl-8c0a58b7a014412bafac09260645eb23-0. INFO 03-01 19:07:07 [logger.py:42] Received request cmpl-925fa2c776a240acb976c04686002ec2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:07 [async_llm.py:261] Added request cmpl-925fa2c776a240acb976c04686002ec2-0. INFO 03-01 19:07:08 [logger.py:42] Received request cmpl-e03a60a9130540bc8e97f1fb78c80be9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:08 [async_llm.py:261] Added request cmpl-e03a60a9130540bc8e97f1fb78c80be9-0. INFO 03-01 19:07:10 [logger.py:42] Received request cmpl-41bddc6b779e4fe59e6a971842451257-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:10 [async_llm.py:261] Added request cmpl-41bddc6b779e4fe59e6a971842451257-0. INFO 03-01 19:07:11 [logger.py:42] Received request cmpl-76f9d9ee5a4244859d73909d6a9c7145-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:11 [async_llm.py:261] Added request cmpl-76f9d9ee5a4244859d73909d6a9c7145-0. INFO 03-01 19:07:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:07:12 [logger.py:42] Received request cmpl-5b264109cdaa482089b3406f33c7c645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:12 [async_llm.py:261] Added request cmpl-5b264109cdaa482089b3406f33c7c645-0. INFO 03-01 19:07:13 [logger.py:42] Received request cmpl-35ec1b9d9e924a44b9364edfb2917e60-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:13 [async_llm.py:261] Added request cmpl-35ec1b9d9e924a44b9364edfb2917e60-0. INFO 03-01 19:07:14 [logger.py:42] Received request cmpl-312ff9a24e3a47e2bc0a6123573cc97d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:14 [async_llm.py:261] Added request cmpl-312ff9a24e3a47e2bc0a6123573cc97d-0. INFO 03-01 19:07:15 [logger.py:42] Received request cmpl-2e70c3bca9934887a8b59c43909065a2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:15 [async_llm.py:261] Added request cmpl-2e70c3bca9934887a8b59c43909065a2-0. INFO 03-01 19:07:16 [logger.py:42] Received request cmpl-9ba5cace58c8465da58e76c9a13a9087-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:16 [async_llm.py:261] Added request cmpl-9ba5cace58c8465da58e76c9a13a9087-0. INFO 03-01 19:07:18 [logger.py:42] Received request cmpl-778e842f5855471394f71774ad788f10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:18 [async_llm.py:261] Added request cmpl-778e842f5855471394f71774ad788f10-0. INFO 03-01 19:07:19 [logger.py:42] Received request cmpl-2cc66b375cf34ed1a7124df27b75a92a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:19 [async_llm.py:261] Added request cmpl-2cc66b375cf34ed1a7124df27b75a92a-0. INFO 03-01 19:07:20 [logger.py:42] Received request cmpl-a3cbe4faf3a84a1e958cd7ec1037a6f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:20 [async_llm.py:261] Added request cmpl-a3cbe4faf3a84a1e958cd7ec1037a6f1-0. INFO 03-01 19:07:21 [logger.py:42] Received request cmpl-9a31846e9bb048e8aa5e0d82ce1b8298-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:21 [async_llm.py:261] Added request cmpl-9a31846e9bb048e8aa5e0d82ce1b8298-0. INFO 03-01 19:07:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:07:22 [logger.py:42] Received request cmpl-75d2bb17280746dcadb435e42da25a7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:22 [async_llm.py:261] Added request cmpl-75d2bb17280746dcadb435e42da25a7c-0. INFO 03-01 19:07:23 [logger.py:42] Received request cmpl-f7e40fddf5ef4b96b8f33367f1bc2f77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:23 [async_llm.py:261] Added request cmpl-f7e40fddf5ef4b96b8f33367f1bc2f77-0. INFO 03-01 19:07:25 [logger.py:42] Received request cmpl-d7be198ec3a24d36b4c35166bfb705a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:25 [async_llm.py:261] Added request cmpl-d7be198ec3a24d36b4c35166bfb705a3-0. INFO 03-01 19:07:26 [logger.py:42] Received request cmpl-b61e064e3cc7401e8530c69f21d0b80d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:26 [async_llm.py:261] Added request cmpl-b61e064e3cc7401e8530c69f21d0b80d-0. INFO 03-01 19:07:27 [logger.py:42] Received request cmpl-31cebe5f2604459ea040c049383e9b6d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:27 [async_llm.py:261] Added request cmpl-31cebe5f2604459ea040c049383e9b6d-0. INFO 03-01 19:07:28 [logger.py:42] Received request cmpl-f1e9417fd28245ebb7fea23216656bd9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:28 [async_llm.py:261] Added request cmpl-f1e9417fd28245ebb7fea23216656bd9-0. INFO 03-01 19:07:29 [logger.py:42] Received request cmpl-2524580bb87041b9b94272e14b734e0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:29 [async_llm.py:261] Added request cmpl-2524580bb87041b9b94272e14b734e0c-0. INFO 03-01 19:07:30 [logger.py:42] Received request cmpl-e944f9ce082f4ea6b26950174f135138-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:30 [async_llm.py:261] Added request cmpl-e944f9ce082f4ea6b26950174f135138-0. INFO 03-01 19:07:32 [logger.py:42] Received request cmpl-1fa50a8bf23c4c8197e664350a598257-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:32 [async_llm.py:261] Added request cmpl-1fa50a8bf23c4c8197e664350a598257-0. INFO 03-01 19:07:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:07:33 [logger.py:42] Received request cmpl-308a2627d0bb46609bd6db21c0296501-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:33 [async_llm.py:261] Added request cmpl-308a2627d0bb46609bd6db21c0296501-0. INFO 03-01 19:07:34 [logger.py:42] Received request cmpl-70dc2fe9ab5048e6a0cdbd06cbbb394b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:34 [async_llm.py:261] Added request cmpl-70dc2fe9ab5048e6a0cdbd06cbbb394b-0. INFO 03-01 19:07:35 [logger.py:42] Received request cmpl-a5dbe292a6124670aefad52fb165c290-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:35 [async_llm.py:261] Added request cmpl-a5dbe292a6124670aefad52fb165c290-0. INFO 03-01 19:07:36 [logger.py:42] Received request cmpl-89a097ecd4634f13b12bd8cbb671dc3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:36 [async_llm.py:261] Added request cmpl-89a097ecd4634f13b12bd8cbb671dc3b-0. INFO 03-01 19:07:37 [logger.py:42] Received request cmpl-55c24a3e3ac84c15a8f786256e453749-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:37 [async_llm.py:261] Added request cmpl-55c24a3e3ac84c15a8f786256e453749-0. INFO 03-01 19:07:39 [logger.py:42] Received request cmpl-d84597084aed474db293d7b51a4fdb48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:39 [async_llm.py:261] Added request cmpl-d84597084aed474db293d7b51a4fdb48-0. INFO 03-01 19:07:40 [logger.py:42] Received request cmpl-63e2c41689534ba8b567a35e12fe86c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:40 [async_llm.py:261] Added request cmpl-63e2c41689534ba8b567a35e12fe86c2-0. INFO 03-01 19:07:41 [logger.py:42] Received request cmpl-b66ec1eae3a94912b0ca5a9187daec23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:41 [async_llm.py:261] Added request cmpl-b66ec1eae3a94912b0ca5a9187daec23-0. INFO 03-01 19:07:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:07:42 [logger.py:42] Received request cmpl-9cc26444eb4741a3ad61147dda0bef56-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:42 [async_llm.py:261] Added request cmpl-9cc26444eb4741a3ad61147dda0bef56-0. INFO 03-01 19:07:43 [logger.py:42] Received request cmpl-287507cacbcf41558e30b957c8455e99-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:43 [async_llm.py:261] Added request cmpl-287507cacbcf41558e30b957c8455e99-0. INFO 03-01 19:07:44 [logger.py:42] Received request cmpl-20f31fd67297478bb97eb40db76a5ada-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:44 [async_llm.py:261] Added request cmpl-20f31fd67297478bb97eb40db76a5ada-0. INFO 03-01 19:07:45 [logger.py:42] Received request cmpl-71d24389c5384b4ab54dec0b9e0a5b75-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:45 [async_llm.py:261] Added request cmpl-71d24389c5384b4ab54dec0b9e0a5b75-0. INFO 03-01 19:07:47 [logger.py:42] Received request cmpl-73781506f862431ca5d568637479fcf6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:47 [async_llm.py:261] Added request cmpl-73781506f862431ca5d568637479fcf6-0. INFO 03-01 19:07:48 [logger.py:42] Received request cmpl-7ac42fb344ed4420b98de96fdda742dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:48 [async_llm.py:261] Added request cmpl-7ac42fb344ed4420b98de96fdda742dc-0. INFO 03-01 19:07:49 [logger.py:42] Received request cmpl-9f7da62f774d47178c59e60e1ef57961-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:49 [async_llm.py:261] Added request cmpl-9f7da62f774d47178c59e60e1ef57961-0. INFO 03-01 19:07:50 [logger.py:42] Received request cmpl-274b42585ab14653a86576a384ead477-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:50 [async_llm.py:261] Added request cmpl-274b42585ab14653a86576a384ead477-0. INFO 03-01 19:07:51 [logger.py:42] Received request cmpl-bfc0312d9cab4ee1a8e2773c54e25913-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:51 [async_llm.py:261] Added request cmpl-bfc0312d9cab4ee1a8e2773c54e25913-0. INFO 03-01 19:07:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:07:52 [logger.py:42] Received request cmpl-5a8fa388658a4edc90160a74d827c7ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:52 [async_llm.py:261] Added request cmpl-5a8fa388658a4edc90160a74d827c7ff-0. INFO 03-01 19:07:54 [logger.py:42] Received request cmpl-b57da588521047a08deeba3c9bd62652-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:54 [async_llm.py:261] Added request cmpl-b57da588521047a08deeba3c9bd62652-0. INFO 03-01 19:07:55 [logger.py:42] Received request cmpl-7754f13246514f399f5a9161a8cd7f47-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:55 [async_llm.py:261] Added request cmpl-7754f13246514f399f5a9161a8cd7f47-0. INFO 03-01 19:07:56 [logger.py:42] Received request cmpl-dfbfa1ee5b7b400698bf1e13442ce1c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:56 [async_llm.py:261] Added request cmpl-dfbfa1ee5b7b400698bf1e13442ce1c2-0. INFO 03-01 19:07:57 [logger.py:42] Received request cmpl-1b0d6bd227b9442b958bdee6d6d91c9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:57 [async_llm.py:261] Added request cmpl-1b0d6bd227b9442b958bdee6d6d91c9e-0. INFO 03-01 19:07:58 [logger.py:42] Received request cmpl-9aa2dd58db8f4593b0f40676e9451f49-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:58 [async_llm.py:261] Added request cmpl-9aa2dd58db8f4593b0f40676e9451f49-0. INFO 03-01 19:07:59 [logger.py:42] Received request cmpl-0361b7154398498db3840c5f8534051a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:07:59 [async_llm.py:261] Added request cmpl-0361b7154398498db3840c5f8534051a-0. INFO 03-01 19:08:01 [logger.py:42] Received request cmpl-d516a6659b8e450b9c20ab0ebb9e5c38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:01 [async_llm.py:261] Added request cmpl-d516a6659b8e450b9c20ab0ebb9e5c38-0. INFO 03-01 19:08:02 [logger.py:42] Received request cmpl-5ddcdb3dacc0436288a95d7437c52077-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:02 [async_llm.py:261] Added request cmpl-5ddcdb3dacc0436288a95d7437c52077-0. INFO 03-01 19:08:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:08:03 [logger.py:42] Received request cmpl-fce22338acf642e78079f96b33f7c988-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:03 [async_llm.py:261] Added request cmpl-fce22338acf642e78079f96b33f7c988-0. INFO 03-01 19:08:04 [logger.py:42] Received request cmpl-43fecd48b84e4a2299c93ad299c69e4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:04 [async_llm.py:261] Added request cmpl-43fecd48b84e4a2299c93ad299c69e4f-0. INFO 03-01 19:08:05 [logger.py:42] Received request cmpl-84ddf2c22e4c4b7eb00a26a3ef45ee0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:05 [async_llm.py:261] Added request cmpl-84ddf2c22e4c4b7eb00a26a3ef45ee0e-0. INFO 03-01 19:08:06 [logger.py:42] Received request cmpl-d90ab5c8d1944dfb863d7db908ad945f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:06 [async_llm.py:261] Added request cmpl-d90ab5c8d1944dfb863d7db908ad945f-0. INFO 03-01 19:08:07 [logger.py:42] Received request cmpl-aceddb9fe5234def81e3b8d84d3f5dac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:07 [async_llm.py:261] Added request cmpl-aceddb9fe5234def81e3b8d84d3f5dac-0. INFO 03-01 19:08:09 [logger.py:42] Received request cmpl-d0a2c66ea3be4674ada61e19c482c3e2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:09 [async_llm.py:261] Added request cmpl-d0a2c66ea3be4674ada61e19c482c3e2-0. INFO 03-01 19:08:10 [logger.py:42] Received request cmpl-d2b1149c764e45ea9e78ddfaadf6b91c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:10 [async_llm.py:261] Added request cmpl-d2b1149c764e45ea9e78ddfaadf6b91c-0. INFO 03-01 19:08:11 [logger.py:42] Received request cmpl-fc45e36ad79348f299d3f8312d13a70e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:11 [async_llm.py:261] Added request cmpl-fc45e36ad79348f299d3f8312d13a70e-0. INFO 03-01 19:08:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:08:12 [logger.py:42] Received request cmpl-9269255dac87451796351c122b44b56d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:12 [async_llm.py:261] Added request cmpl-9269255dac87451796351c122b44b56d-0. INFO 03-01 19:08:13 [logger.py:42] Received request cmpl-ff8625e5bed44cb2b564e6dce6ae57d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:13 [async_llm.py:261] Added request cmpl-ff8625e5bed44cb2b564e6dce6ae57d3-0. INFO 03-01 19:08:14 [logger.py:42] Received request cmpl-4bf49707d6394889bd8f212ab8aede9e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:14 [async_llm.py:261] Added request cmpl-4bf49707d6394889bd8f212ab8aede9e-0. INFO 03-01 19:08:16 [logger.py:42] Received request cmpl-a97f757766f84651b18b8d3587825a4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:16 [async_llm.py:261] Added request cmpl-a97f757766f84651b18b8d3587825a4c-0. INFO 03-01 19:08:17 [logger.py:42] Received request cmpl-114af43d243b42bd90ef809eaf0452b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:17 [async_llm.py:261] Added request cmpl-114af43d243b42bd90ef809eaf0452b1-0. INFO 03-01 19:08:18 [logger.py:42] Received request cmpl-0f29d9b666a54244856c98de4def1135-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:18 [async_llm.py:261] Added request cmpl-0f29d9b666a54244856c98de4def1135-0. INFO 03-01 19:08:19 [logger.py:42] Received request cmpl-5fbd1e671d3845288792986be579b8ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:19 [async_llm.py:261] Added request cmpl-5fbd1e671d3845288792986be579b8ef-0. INFO 03-01 19:08:20 [logger.py:42] Received request cmpl-6f848f87d42f4957845aa53a5a6e543f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:20 [async_llm.py:261] Added request cmpl-6f848f87d42f4957845aa53a5a6e543f-0. INFO 03-01 19:08:21 [logger.py:42] Received request cmpl-ed703a32eaea4a48b3415d03bfb719ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:21 [async_llm.py:261] Added request cmpl-ed703a32eaea4a48b3415d03bfb719ff-0. INFO 03-01 19:08:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:08:23 [logger.py:42] Received request cmpl-f134e95f859d43d5bb5c626749e24841-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:23 [async_llm.py:261] Added request cmpl-f134e95f859d43d5bb5c626749e24841-0. INFO 03-01 19:08:24 [logger.py:42] Received request cmpl-c387a6e15d65455db902d433152fda71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:24 [async_llm.py:261] Added request cmpl-c387a6e15d65455db902d433152fda71-0. INFO 03-01 19:08:25 [logger.py:42] Received request cmpl-2816477944cb4f2cbfaa9a258b284003-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:25 [async_llm.py:261] Added request cmpl-2816477944cb4f2cbfaa9a258b284003-0. INFO 03-01 19:08:26 [logger.py:42] Received request cmpl-93d8912441614eeb98030b331fb4fa11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:26 [async_llm.py:261] Added request cmpl-93d8912441614eeb98030b331fb4fa11-0. INFO 03-01 19:08:27 [logger.py:42] Received request cmpl-bc95019a5d8c4367834e568f63611a3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:27 [async_llm.py:261] Added request cmpl-bc95019a5d8c4367834e568f63611a3d-0. INFO 03-01 19:08:28 [logger.py:42] Received request cmpl-f73352e2aa824df995c71fb7a587f56a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:28 [async_llm.py:261] Added request cmpl-f73352e2aa824df995c71fb7a587f56a-0. INFO 03-01 19:08:29 [logger.py:42] Received request cmpl-c2606310b7554300933cff62a876b2fb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:29 [async_llm.py:261] Added request cmpl-c2606310b7554300933cff62a876b2fb-0. INFO 03-01 19:08:31 [logger.py:42] Received request cmpl-7b313714f8d749d3bfe0474e02107cb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:31 [async_llm.py:261] Added request cmpl-7b313714f8d749d3bfe0474e02107cb3-0. INFO 03-01 19:08:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:08:32 [logger.py:42] Received request cmpl-ab2cc5fd6fbc4774b4941f710e85fbe6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:32 [async_llm.py:261] Added request cmpl-ab2cc5fd6fbc4774b4941f710e85fbe6-0. INFO 03-01 19:08:33 [logger.py:42] Received request cmpl-3f04967e21a74b9c9895c511f161583f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:33 [async_llm.py:261] Added request cmpl-3f04967e21a74b9c9895c511f161583f-0. INFO 03-01 19:08:34 [logger.py:42] Received request cmpl-666bc6506d944fab9bb0e18a0996086e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:34 [async_llm.py:261] Added request cmpl-666bc6506d944fab9bb0e18a0996086e-0. INFO 03-01 19:08:35 [logger.py:42] Received request cmpl-ddcd666748b24197af70918bd6e3a8f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:35 [async_llm.py:261] Added request cmpl-ddcd666748b24197af70918bd6e3a8f8-0. INFO 03-01 19:08:36 [logger.py:42] Received request cmpl-b2747a7b4006446088d2a7b9dabff3f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:36 [async_llm.py:261] Added request cmpl-b2747a7b4006446088d2a7b9dabff3f5-0. INFO 03-01 19:08:38 [logger.py:42] Received request cmpl-3f87b4901a7742dd9ddd906793f1bb25-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:38 [async_llm.py:261] Added request cmpl-3f87b4901a7742dd9ddd906793f1bb25-0. INFO 03-01 19:08:39 [logger.py:42] Received request cmpl-4d2733536ba449f0b62f07faaf910dd2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:39 [async_llm.py:261] Added request cmpl-4d2733536ba449f0b62f07faaf910dd2-0. INFO 03-01 19:08:40 [logger.py:42] Received request cmpl-4404c206e97a48b9bd6250bfef68d51d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:40 [async_llm.py:261] Added request cmpl-4404c206e97a48b9bd6250bfef68d51d-0. INFO 03-01 19:08:41 [logger.py:42] Received request cmpl-b4cdf504a00643748012095cc156b051-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:41 [async_llm.py:261] Added request cmpl-b4cdf504a00643748012095cc156b051-0. INFO 03-01 19:08:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:08:42 [logger.py:42] Received request cmpl-8bef0dacda5e40af9f739ba680039a76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:42 [async_llm.py:261] Added request cmpl-8bef0dacda5e40af9f739ba680039a76-0. INFO 03-01 19:08:44 [logger.py:42] Received request cmpl-0182ead7ccf940b3b6ed40d7cd243234-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:44 [async_llm.py:261] Added request cmpl-0182ead7ccf940b3b6ed40d7cd243234-0. INFO 03-01 19:08:45 [logger.py:42] Received request cmpl-d7053179ea4440a7a5b871f8ff0c772c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:45 [async_llm.py:261] Added request cmpl-d7053179ea4440a7a5b871f8ff0c772c-0. INFO 03-01 19:08:46 [logger.py:42] Received request cmpl-05c6c7e588eb42b4ac752055d131d4a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:46 [async_llm.py:261] Added request cmpl-05c6c7e588eb42b4ac752055d131d4a4-0. INFO 03-01 19:08:47 [logger.py:42] Received request cmpl-f1e33c7a22014a18bb2276d5d123862a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:47 [async_llm.py:261] Added request cmpl-f1e33c7a22014a18bb2276d5d123862a-0. INFO 03-01 19:08:48 [logger.py:42] Received request cmpl-7b88933b44e34458b3fc664231ce67ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:48 [async_llm.py:261] Added request cmpl-7b88933b44e34458b3fc664231ce67ce-0. INFO 03-01 19:08:49 [logger.py:42] Received request cmpl-67ecf374c24e49068ee576362bcaccee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:49 [async_llm.py:261] Added request cmpl-67ecf374c24e49068ee576362bcaccee-0. INFO 03-01 19:08:51 [logger.py:42] Received request cmpl-26f677aa78fe44ba8d1baed73a5ffb11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:51 [async_llm.py:261] Added request cmpl-26f677aa78fe44ba8d1baed73a5ffb11-0. INFO 03-01 19:08:52 [logger.py:42] Received request cmpl-90e4222201d94fb38e46d0e614d7763e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:52 [async_llm.py:261] Added request cmpl-90e4222201d94fb38e46d0e614d7763e-0. INFO 03-01 19:08:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:08:53 [logger.py:42] Received request cmpl-dabb8d139f7148258db2b5cdaf0c69f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:53 [async_llm.py:261] Added request cmpl-dabb8d139f7148258db2b5cdaf0c69f1-0. INFO 03-01 19:08:54 [logger.py:42] Received request cmpl-5b4436f13b8343a3a09504a3074e256c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:54 [async_llm.py:261] Added request cmpl-5b4436f13b8343a3a09504a3074e256c-0. INFO 03-01 19:08:55 [logger.py:42] Received request cmpl-a1a447149f164a52ab742b279567f7b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:55 [async_llm.py:261] Added request cmpl-a1a447149f164a52ab742b279567f7b8-0. INFO 03-01 19:08:56 [logger.py:42] Received request cmpl-d92e729b1673447ab47e6c3930fee6a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:56 [async_llm.py:261] Added request cmpl-d92e729b1673447ab47e6c3930fee6a8-0. INFO 03-01 19:08:58 [logger.py:42] Received request cmpl-11af10b0c80347bd8a96a41a814e5987-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:58 [async_llm.py:261] Added request cmpl-11af10b0c80347bd8a96a41a814e5987-0. INFO 03-01 19:08:59 [logger.py:42] Received request cmpl-99e2875f80ba46c9bc54d7fa46dd1e63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:08:59 [async_llm.py:261] Added request cmpl-99e2875f80ba46c9bc54d7fa46dd1e63-0. INFO 03-01 19:09:00 [logger.py:42] Received request cmpl-dc6f9a2330a14aea9e7dd2d66c4ab68c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:00 [async_llm.py:261] Added request cmpl-dc6f9a2330a14aea9e7dd2d66c4ab68c-0. INFO 03-01 19:09:01 [logger.py:42] Received request cmpl-c9d7e7a873b04415a16e6cb5e92180b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:01 [async_llm.py:261] Added request cmpl-c9d7e7a873b04415a16e6cb5e92180b6-0. INFO 03-01 19:09:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:09:02 [logger.py:42] Received request cmpl-0f6c1c3802e34aadadfb2a458c295d16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:02 [async_llm.py:261] Added request cmpl-0f6c1c3802e34aadadfb2a458c295d16-0. INFO 03-01 19:09:03 [logger.py:42] Received request cmpl-9184751238ba4fd1b4fc7ebacb0cb565-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:03 [async_llm.py:261] Added request cmpl-9184751238ba4fd1b4fc7ebacb0cb565-0. INFO 03-01 19:09:04 [logger.py:42] Received request cmpl-b88a63f025e94328949c23835faf044e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:04 [async_llm.py:261] Added request cmpl-b88a63f025e94328949c23835faf044e-0. INFO 03-01 19:09:06 [logger.py:42] Received request cmpl-6d4b76d975734ef1b52ce2fcb6e4fb6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:06 [async_llm.py:261] Added request cmpl-6d4b76d975734ef1b52ce2fcb6e4fb6b-0. INFO 03-01 19:09:07 [logger.py:42] Received request cmpl-61d71b71502c4746bb7dfd4a1787b0b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:07 [async_llm.py:261] Added request cmpl-61d71b71502c4746bb7dfd4a1787b0b7-0. INFO 03-01 19:09:08 [logger.py:42] Received request cmpl-7316db59c2464f33be046d06df342215-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:08 [async_llm.py:261] Added request cmpl-7316db59c2464f33be046d06df342215-0. INFO 03-01 19:09:09 [logger.py:42] Received request cmpl-b4f69484b3cc463fa35d24fac27ad6a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:09 [async_llm.py:261] Added request cmpl-b4f69484b3cc463fa35d24fac27ad6a3-0. INFO 03-01 19:09:10 [logger.py:42] Received request cmpl-631ac7c6eb0849ff97ba20a893c98922-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:10 [async_llm.py:261] Added request cmpl-631ac7c6eb0849ff97ba20a893c98922-0. INFO 03-01 19:09:11 [logger.py:42] Received request cmpl-2a342a9d8ebc4776b08ff52e4e0a108f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:11 [async_llm.py:261] Added request cmpl-2a342a9d8ebc4776b08ff52e4e0a108f-0. INFO 03-01 19:09:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:09:13 [logger.py:42] Received request cmpl-211467d8046f48b8a174b76e05fba067-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:13 [async_llm.py:261] Added request cmpl-211467d8046f48b8a174b76e05fba067-0. INFO 03-01 19:09:14 [logger.py:42] Received request cmpl-320754af12f44694a5a877766a4e2e1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:14 [async_llm.py:261] Added request cmpl-320754af12f44694a5a877766a4e2e1e-0. INFO 03-01 19:09:15 [logger.py:42] Received request cmpl-b52cc37c881244fe98bb1877a0189ff8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:15 [async_llm.py:261] Added request cmpl-b52cc37c881244fe98bb1877a0189ff8-0. INFO 03-01 19:09:16 [logger.py:42] Received request cmpl-b553a5bbe888433fb6810c758da8032c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:16 [async_llm.py:261] Added request cmpl-b553a5bbe888433fb6810c758da8032c-0. INFO 03-01 19:09:17 [logger.py:42] Received request cmpl-51ce272d132d4c00aecf913c57d775df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:17 [async_llm.py:261] Added request cmpl-51ce272d132d4c00aecf913c57d775df-0. INFO 03-01 19:09:18 [logger.py:42] Received request cmpl-afcbd30333214856874d9a88218d66a6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:18 [async_llm.py:261] Added request cmpl-afcbd30333214856874d9a88218d66a6-0. INFO 03-01 19:09:20 [logger.py:42] Received request cmpl-3a2da287a2fd4dd1a38a137ad23db098-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:20 [async_llm.py:261] Added request cmpl-3a2da287a2fd4dd1a38a137ad23db098-0. INFO 03-01 19:09:21 [logger.py:42] Received request cmpl-276735c74fa14c20ad3d369ca9c996a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:21 [async_llm.py:261] Added request cmpl-276735c74fa14c20ad3d369ca9c996a9-0. INFO 03-01 19:09:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:09:22 [logger.py:42] Received request cmpl-8a952d2d2abb4cdf88c67770a008dc41-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:22 [async_llm.py:261] Added request cmpl-8a952d2d2abb4cdf88c67770a008dc41-0. INFO 03-01 19:09:23 [logger.py:42] Received request cmpl-9645fafa87894ce9998d04796db28c30-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:23 [async_llm.py:261] Added request cmpl-9645fafa87894ce9998d04796db28c30-0. INFO 03-01 19:09:24 [logger.py:42] Received request cmpl-7b5fe3878ac24b5f8383107cf53057df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:24 [async_llm.py:261] Added request cmpl-7b5fe3878ac24b5f8383107cf53057df-0. INFO 03-01 19:09:25 [logger.py:42] Received request cmpl-94cc0fb5e0aa4c6f8885be6e6acd81dc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:25 [async_llm.py:261] Added request cmpl-94cc0fb5e0aa4c6f8885be6e6acd81dc-0. INFO 03-01 19:09:26 [logger.py:42] Received request cmpl-cefd5f2ed53340f98bd19082ce2dd8d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:26 [async_llm.py:261] Added request cmpl-cefd5f2ed53340f98bd19082ce2dd8d0-0. INFO 03-01 19:09:28 [logger.py:42] Received request cmpl-b11e1b8bec704f299105c7cbfa093489-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:28 [async_llm.py:261] Added request cmpl-b11e1b8bec704f299105c7cbfa093489-0. INFO 03-01 19:09:29 [logger.py:42] Received request cmpl-235feedaabdc4492863eade85e04777e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:29 [async_llm.py:261] Added request cmpl-235feedaabdc4492863eade85e04777e-0. INFO 03-01 19:09:30 [logger.py:42] Received request cmpl-e1869e3bb16642568b1b7d0b9ee0e48f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:30 [async_llm.py:261] Added request cmpl-e1869e3bb16642568b1b7d0b9ee0e48f-0. INFO 03-01 19:09:31 [logger.py:42] Received request cmpl-0a003780b46d4dabb555c0a26508e15d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:31 [async_llm.py:261] Added request cmpl-0a003780b46d4dabb555c0a26508e15d-0. INFO 03-01 19:09:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:09:32 [logger.py:42] Received request cmpl-7822623385634278b1301a5e127b7ff4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:32 [async_llm.py:261] Added request cmpl-7822623385634278b1301a5e127b7ff4-0. INFO 03-01 19:09:33 [logger.py:42] Received request cmpl-8ea3801fb4f640ec944979c30d9b7898-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:33 [async_llm.py:261] Added request cmpl-8ea3801fb4f640ec944979c30d9b7898-0. INFO 03-01 19:09:35 [logger.py:42] Received request cmpl-ba3f7acc83fe4ca5b370646172698f84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:35 [async_llm.py:261] Added request cmpl-ba3f7acc83fe4ca5b370646172698f84-0. INFO 03-01 19:09:36 [logger.py:42] Received request cmpl-af0b6f210c3e45f7a394ba382829948a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:36 [async_llm.py:261] Added request cmpl-af0b6f210c3e45f7a394ba382829948a-0. INFO 03-01 19:09:37 [logger.py:42] Received request cmpl-bf8c6f9c42b54514896358a6f8fc1125-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:37 [async_llm.py:261] Added request cmpl-bf8c6f9c42b54514896358a6f8fc1125-0. INFO 03-01 19:09:38 [logger.py:42] Received request cmpl-84240262ff2a4b3a82b8dc5556bcfbd8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:38 [async_llm.py:261] Added request cmpl-84240262ff2a4b3a82b8dc5556bcfbd8-0. INFO 03-01 19:09:39 [logger.py:42] Received request cmpl-5fe82997b08840d9b25849ede9fd6293-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:39 [async_llm.py:261] Added request cmpl-5fe82997b08840d9b25849ede9fd6293-0. INFO 03-01 19:09:40 [logger.py:42] Received request cmpl-a15484a008234befa5b7fb144724d74d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:40 [async_llm.py:261] Added request cmpl-a15484a008234befa5b7fb144724d74d-0. INFO 03-01 19:09:42 [logger.py:42] Received request cmpl-07586c1aca3c47df8fbbc38e33c1ca64-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:42 [async_llm.py:261] Added request cmpl-07586c1aca3c47df8fbbc38e33c1ca64-0. INFO 03-01 19:09:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:09:43 [logger.py:42] Received request cmpl-909c25df71c64b099157f5e8a5ef5563-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:43 [async_llm.py:261] Added request cmpl-909c25df71c64b099157f5e8a5ef5563-0. INFO 03-01 19:09:44 [logger.py:42] Received request cmpl-10c07c6350424ef2a12650221f88cc63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:44 [async_llm.py:261] Added request cmpl-10c07c6350424ef2a12650221f88cc63-0. INFO 03-01 19:09:45 [logger.py:42] Received request cmpl-83e486373bc14d4ba039aa1b188c1e77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:45 [async_llm.py:261] Added request cmpl-83e486373bc14d4ba039aa1b188c1e77-0. INFO 03-01 19:09:46 [logger.py:42] Received request cmpl-83b7b31ece4144edb2c87dded3f56572-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:46 [async_llm.py:261] Added request cmpl-83b7b31ece4144edb2c87dded3f56572-0. INFO 03-01 19:09:47 [logger.py:42] Received request cmpl-8c209911d12543a59f5fc652082d94df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:47 [async_llm.py:261] Added request cmpl-8c209911d12543a59f5fc652082d94df-0. INFO 03-01 19:09:49 [logger.py:42] Received request cmpl-aba957de0c664d9dbe13cde2435b437f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:49 [async_llm.py:261] Added request cmpl-aba957de0c664d9dbe13cde2435b437f-0. INFO 03-01 19:09:50 [logger.py:42] Received request cmpl-582779c87951424299d85ee2347bf04f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:50 [async_llm.py:261] Added request cmpl-582779c87951424299d85ee2347bf04f-0. INFO 03-01 19:09:51 [logger.py:42] Received request cmpl-cd12cc1fb8a04382bfa63bd22bd5201a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:51 [async_llm.py:261] Added request cmpl-cd12cc1fb8a04382bfa63bd22bd5201a-0. INFO 03-01 19:09:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:09:52 [logger.py:42] Received request cmpl-1fb7ffcf25aa4dc099576865e02b47c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:52 [async_llm.py:261] Added request cmpl-1fb7ffcf25aa4dc099576865e02b47c1-0. INFO 03-01 19:09:53 [logger.py:42] Received request cmpl-5c3c2ba2a094461dab30da30d7ff9daf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:53 [async_llm.py:261] Added request cmpl-5c3c2ba2a094461dab30da30d7ff9daf-0. INFO 03-01 19:09:54 [logger.py:42] Received request cmpl-69cc1149c5c04cba961705fc4b629dca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:54 [async_llm.py:261] Added request cmpl-69cc1149c5c04cba961705fc4b629dca-0. INFO 03-01 19:09:56 [logger.py:42] Received request cmpl-31a78b37efd94699b7ffbfabd210dca3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:56 [async_llm.py:261] Added request cmpl-31a78b37efd94699b7ffbfabd210dca3-0. INFO 03-01 19:09:57 [logger.py:42] Received request cmpl-d32b5e9b327d4cc39206b354fb2681bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:57 [async_llm.py:261] Added request cmpl-d32b5e9b327d4cc39206b354fb2681bf-0. INFO 03-01 19:09:58 [logger.py:42] Received request cmpl-339a7d9b2bb84e1e89325179eb2060f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:58 [async_llm.py:261] Added request cmpl-339a7d9b2bb84e1e89325179eb2060f4-0. INFO 03-01 19:09:59 [logger.py:42] Received request cmpl-91e16b705d4e4365b31061bae351a860-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:09:59 [async_llm.py:261] Added request cmpl-91e16b705d4e4365b31061bae351a860-0. INFO 03-01 19:10:00 [logger.py:42] Received request cmpl-843540dab7da46b3afe458ec4c5b47f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:00 [async_llm.py:261] Added request cmpl-843540dab7da46b3afe458ec4c5b47f8-0. INFO 03-01 19:10:01 [logger.py:42] Received request cmpl-b064ea86cd3f419fbbac2f47036d5748-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:01 [async_llm.py:261] Added request cmpl-b064ea86cd3f419fbbac2f47036d5748-0. INFO 03-01 19:10:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:10:03 [logger.py:42] Received request cmpl-bdfc14573fa043589d1b06ca8e25ed1b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:03 [async_llm.py:261] Added request cmpl-bdfc14573fa043589d1b06ca8e25ed1b-0. INFO 03-01 19:10:04 [logger.py:42] Received request cmpl-060d6d55113940438f3b23397298cc0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:04 [async_llm.py:261] Added request cmpl-060d6d55113940438f3b23397298cc0c-0. INFO 03-01 19:10:05 [logger.py:42] Received request cmpl-c4291e796ca94229a5ff60da180c66a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:05 [async_llm.py:261] Added request cmpl-c4291e796ca94229a5ff60da180c66a0-0. INFO 03-01 19:10:06 [logger.py:42] Received request cmpl-df29ea0838244b8fb1911369d3d00f12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:06 [async_llm.py:261] Added request cmpl-df29ea0838244b8fb1911369d3d00f12-0. INFO 03-01 19:10:07 [logger.py:42] Received request cmpl-3726b08f1418474b9ad08210653a87fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:07 [async_llm.py:261] Added request cmpl-3726b08f1418474b9ad08210653a87fd-0. INFO 03-01 19:10:08 [logger.py:42] Received request cmpl-f1f3ac0fbbf3419a86bdef39b0e6e4d7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:08 [async_llm.py:261] Added request cmpl-f1f3ac0fbbf3419a86bdef39b0e6e4d7-0. INFO 03-01 19:10:10 [logger.py:42] Received request cmpl-9203f8d90598408ba3e9130db7a96fb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:10 [async_llm.py:261] Added request cmpl-9203f8d90598408ba3e9130db7a96fb7-0. INFO 03-01 19:10:11 [logger.py:42] Received request cmpl-0108d2d958d14f47a54b65eb92ce1a50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:11 [async_llm.py:261] Added request cmpl-0108d2d958d14f47a54b65eb92ce1a50-0. INFO 03-01 19:10:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:10:12 [logger.py:42] Received request cmpl-f3d005cc519c4f17977e2f75d730401f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:12 [async_llm.py:261] Added request cmpl-f3d005cc519c4f17977e2f75d730401f-0. INFO 03-01 19:10:13 [logger.py:42] Received request cmpl-73676b7d4b23430b8062acf3fc4f755f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:13 [async_llm.py:261] Added request cmpl-73676b7d4b23430b8062acf3fc4f755f-0. INFO 03-01 19:10:14 [logger.py:42] Received request cmpl-4f655cef583c48099e3a838a97c7abdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:14 [async_llm.py:261] Added request cmpl-4f655cef583c48099e3a838a97c7abdc-0. INFO 03-01 19:10:15 [logger.py:42] Received request cmpl-3c8c057472c54b02b665b239293b6592-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:15 [async_llm.py:261] Added request cmpl-3c8c057472c54b02b665b239293b6592-0. INFO 03-01 19:10:17 [logger.py:42] Received request cmpl-683342e609df49928afbdf3d1a27c98d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:17 [async_llm.py:261] Added request cmpl-683342e609df49928afbdf3d1a27c98d-0. INFO 03-01 19:10:18 [logger.py:42] Received request cmpl-10e33d78db014eefaf8b7ad15da0e56c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:18 [async_llm.py:261] Added request cmpl-10e33d78db014eefaf8b7ad15da0e56c-0. INFO 03-01 19:10:19 [logger.py:42] Received request cmpl-806ad28586244fd1833a57f9e66f8c76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:19 [async_llm.py:261] Added request cmpl-806ad28586244fd1833a57f9e66f8c76-0. INFO 03-01 19:10:20 [logger.py:42] Received request cmpl-ec21508a997d44f29535241fb78a9777-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:20 [async_llm.py:261] Added request cmpl-ec21508a997d44f29535241fb78a9777-0. INFO 03-01 19:10:21 [logger.py:42] Received request cmpl-9aa507087eec4a448d6c0a3af6d5032f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:21 [async_llm.py:261] Added request cmpl-9aa507087eec4a448d6c0a3af6d5032f-0. INFO 03-01 19:10:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:10:22 [logger.py:42] Received request cmpl-3c488ec4ea4f4f12897b1441147ee914-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:22 [async_llm.py:261] Added request cmpl-3c488ec4ea4f4f12897b1441147ee914-0. INFO 03-01 19:10:23 [logger.py:42] Received request cmpl-d771aeacc7214075bd6b6492d5992b5b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:23 [async_llm.py:261] Added request cmpl-d771aeacc7214075bd6b6492d5992b5b-0. INFO 03-01 19:10:25 [logger.py:42] Received request cmpl-205587c429b341a28854121fe4e9e839-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:25 [async_llm.py:261] Added request cmpl-205587c429b341a28854121fe4e9e839-0. INFO 03-01 19:10:26 [logger.py:42] Received request cmpl-fd5011ffc793434b8fce053b022c4c84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:26 [async_llm.py:261] Added request cmpl-fd5011ffc793434b8fce053b022c4c84-0. INFO 03-01 19:10:27 [logger.py:42] Received request cmpl-a30248a2c7394dfc92967fd6c0908dc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:27 [async_llm.py:261] Added request cmpl-a30248a2c7394dfc92967fd6c0908dc7-0. INFO 03-01 19:10:28 [logger.py:42] Received request cmpl-d4acdf7ea1f74e32bd452f9528385bde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:28 [async_llm.py:261] Added request cmpl-d4acdf7ea1f74e32bd452f9528385bde-0. INFO 03-01 19:10:29 [logger.py:42] Received request cmpl-e314f7738dfa480e95278b7641d2e225-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:29 [async_llm.py:261] Added request cmpl-e314f7738dfa480e95278b7641d2e225-0. INFO 03-01 19:10:30 [logger.py:42] Received request cmpl-553775d1ca144e9c9dd646ee0759ac15-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:30 [async_llm.py:261] Added request cmpl-553775d1ca144e9c9dd646ee0759ac15-0. INFO 03-01 19:10:32 [logger.py:42] Received request cmpl-8a12f3632ca84415b3bff7479ab3a799-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:32 [async_llm.py:261] Added request cmpl-8a12f3632ca84415b3bff7479ab3a799-0. INFO 03-01 19:10:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:10:33 [logger.py:42] Received request cmpl-bb86e330062a442ab674d619c0a776ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:33 [async_llm.py:261] Added request cmpl-bb86e330062a442ab674d619c0a776ef-0. INFO 03-01 19:10:34 [logger.py:42] Received request cmpl-5417c4766d944d7384a8d0b7b5ea3ef4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:34 [async_llm.py:261] Added request cmpl-5417c4766d944d7384a8d0b7b5ea3ef4-0. INFO 03-01 19:10:35 [logger.py:42] Received request cmpl-3ebe81df90c74c7f8a8e771e7b78fd10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:35 [async_llm.py:261] Added request cmpl-3ebe81df90c74c7f8a8e771e7b78fd10-0. INFO 03-01 19:10:36 [logger.py:42] Received request cmpl-43ba6164fab548fea116a40e82152b33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:36 [async_llm.py:261] Added request cmpl-43ba6164fab548fea116a40e82152b33-0. INFO 03-01 19:10:37 [logger.py:42] Received request cmpl-6922a3d8e1464f7f9871cfe76a760af6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:37 [async_llm.py:261] Added request cmpl-6922a3d8e1464f7f9871cfe76a760af6-0. INFO 03-01 19:10:39 [logger.py:42] Received request cmpl-c73613b14feb475896468f2ad86f2d36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:39 [async_llm.py:261] Added request cmpl-c73613b14feb475896468f2ad86f2d36-0. INFO 03-01 19:10:40 [logger.py:42] Received request cmpl-07e9f1a48f674fdc84f9970b08a12c14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:40 [async_llm.py:261] Added request cmpl-07e9f1a48f674fdc84f9970b08a12c14-0. INFO 03-01 19:10:41 [logger.py:42] Received request cmpl-47460654267f4889b246266c441cd722-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:41 [async_llm.py:261] Added request cmpl-47460654267f4889b246266c441cd722-0. INFO 03-01 19:10:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:10:42 [logger.py:42] Received request cmpl-3befcf6e207b4a2bbeb5b239c479e712-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:42 [async_llm.py:261] Added request cmpl-3befcf6e207b4a2bbeb5b239c479e712-0. INFO 03-01 19:10:43 [logger.py:42] Received request cmpl-c1cabd1f728f48ef9f27e068c31f0bec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:43 [async_llm.py:261] Added request cmpl-c1cabd1f728f48ef9f27e068c31f0bec-0. INFO 03-01 19:10:44 [logger.py:42] Received request cmpl-dfe0217141194bdb906456d69dc3a919-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:44 [async_llm.py:261] Added request cmpl-dfe0217141194bdb906456d69dc3a919-0. INFO 03-01 19:10:46 [logger.py:42] Received request cmpl-78e0a948cd904f3998e04ce9d597189e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:46 [async_llm.py:261] Added request cmpl-78e0a948cd904f3998e04ce9d597189e-0. INFO 03-01 19:10:47 [logger.py:42] Received request cmpl-fe39a9f0fff74efda36af61e1c7debf8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:47 [async_llm.py:261] Added request cmpl-fe39a9f0fff74efda36af61e1c7debf8-0. INFO 03-01 19:10:48 [logger.py:42] Received request cmpl-55c434077cde4785a51ba51f22ab3349-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:48 [async_llm.py:261] Added request cmpl-55c434077cde4785a51ba51f22ab3349-0. INFO 03-01 19:10:49 [logger.py:42] Received request cmpl-659426109a4a4370b878af1f4ae0b138-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:49 [async_llm.py:261] Added request cmpl-659426109a4a4370b878af1f4ae0b138-0. INFO 03-01 19:10:50 [logger.py:42] Received request cmpl-a8c5d956546b4cd9bc76c6159f5e351d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:50 [async_llm.py:261] Added request cmpl-a8c5d956546b4cd9bc76c6159f5e351d-0. INFO 03-01 19:10:51 [logger.py:42] Received request cmpl-276da60f907f47948bf3354b255039fe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:51 [async_llm.py:261] Added request cmpl-276da60f907f47948bf3354b255039fe-0. INFO 03-01 19:10:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:10:52 [logger.py:42] Received request cmpl-1c0697365cea467b9bef5ff11ecac4c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:52 [async_llm.py:261] Added request cmpl-1c0697365cea467b9bef5ff11ecac4c6-0. INFO 03-01 19:10:54 [logger.py:42] Received request cmpl-7a2bb1bf9f2247758197f166d1ac32cd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:54 [async_llm.py:261] Added request cmpl-7a2bb1bf9f2247758197f166d1ac32cd-0. INFO 03-01 19:10:55 [logger.py:42] Received request cmpl-7ce7fed94ac64da4b0328fa53a37e800-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:55 [async_llm.py:261] Added request cmpl-7ce7fed94ac64da4b0328fa53a37e800-0. INFO 03-01 19:10:56 [logger.py:42] Received request cmpl-93658bc3e4eb41ba8a207cd5cc67aa91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:56 [async_llm.py:261] Added request cmpl-93658bc3e4eb41ba8a207cd5cc67aa91-0. INFO 03-01 19:10:57 [logger.py:42] Received request cmpl-3f831d18cd53417a8c96344a842aff16-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:57 [async_llm.py:261] Added request cmpl-3f831d18cd53417a8c96344a842aff16-0. INFO 03-01 19:10:58 [logger.py:42] Received request cmpl-07a5477d12064ba79dded2943e358203-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:58 [async_llm.py:261] Added request cmpl-07a5477d12064ba79dded2943e358203-0. INFO 03-01 19:10:59 [logger.py:42] Received request cmpl-3c7c604adc324705a5f7da13e8eaccbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:10:59 [async_llm.py:261] Added request cmpl-3c7c604adc324705a5f7da13e8eaccbf-0. INFO 03-01 19:11:01 [logger.py:42] Received request cmpl-eae23262efe84babb8801333e7051958-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:01 [async_llm.py:261] Added request cmpl-eae23262efe84babb8801333e7051958-0. INFO 03-01 19:11:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:11:02 [logger.py:42] Received request cmpl-dfedf98b1854485bb148c92aff4193be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:02 [async_llm.py:261] Added request cmpl-dfedf98b1854485bb148c92aff4193be-0. INFO 03-01 19:11:03 [logger.py:42] Received request cmpl-d0b792f3210144b4bfc672f30bfa30aa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:03 [async_llm.py:261] Added request cmpl-d0b792f3210144b4bfc672f30bfa30aa-0. INFO 03-01 19:11:04 [logger.py:42] Received request cmpl-249d4d1a4e704824a29f72870b5e84d1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:04 [async_llm.py:261] Added request cmpl-249d4d1a4e704824a29f72870b5e84d1-0. INFO 03-01 19:11:05 [logger.py:42] Received request cmpl-3e83bf4ced3c44df812060148a347c34-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:05 [async_llm.py:261] Added request cmpl-3e83bf4ced3c44df812060148a347c34-0. INFO 03-01 19:11:06 [logger.py:42] Received request cmpl-cc6b99aabcb54bd4aabbfd2367a06d4d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:06 [async_llm.py:261] Added request cmpl-cc6b99aabcb54bd4aabbfd2367a06d4d-0. INFO 03-01 19:11:08 [logger.py:42] Received request cmpl-ea2a8a7dfef54e46ab538112e4e7669f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:08 [async_llm.py:261] Added request cmpl-ea2a8a7dfef54e46ab538112e4e7669f-0. INFO 03-01 19:11:09 [logger.py:42] Received request cmpl-a6156158250a46308e237b7935f9a0cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:09 [async_llm.py:261] Added request cmpl-a6156158250a46308e237b7935f9a0cb-0. INFO 03-01 19:11:10 [logger.py:42] Received request cmpl-5eca3e8704dc425ca309815a3ad39dd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:10 [async_llm.py:261] Added request cmpl-5eca3e8704dc425ca309815a3ad39dd7-0. INFO 03-01 19:11:11 [logger.py:42] Received request cmpl-cc56a6b41ad24adab4a3355ec94e3eb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:11 [async_llm.py:261] Added request cmpl-cc56a6b41ad24adab4a3355ec94e3eb7-0. INFO 03-01 19:11:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:11:12 [logger.py:42] Received request cmpl-81137b8e362246e58f8523a8b689f4e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:12 [async_llm.py:261] Added request cmpl-81137b8e362246e58f8523a8b689f4e4-0. INFO 03-01 19:11:13 [logger.py:42] Received request cmpl-1ad845dd6ff346419e9db55521fcfa4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:13 [async_llm.py:261] Added request cmpl-1ad845dd6ff346419e9db55521fcfa4c-0. INFO 03-01 19:11:14 [logger.py:42] Received request cmpl-c5c07a2bff074db4aef97e0a1a2bc868-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:14 [async_llm.py:261] Added request cmpl-c5c07a2bff074db4aef97e0a1a2bc868-0. INFO 03-01 19:11:16 [logger.py:42] Received request cmpl-638447db8fc34cb0a7e9fd60589bf700-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:16 [async_llm.py:261] Added request cmpl-638447db8fc34cb0a7e9fd60589bf700-0. INFO 03-01 19:11:17 [logger.py:42] Received request cmpl-ae156aff10224a6a83cc77ae4600872e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:17 [async_llm.py:261] Added request cmpl-ae156aff10224a6a83cc77ae4600872e-0. INFO 03-01 19:11:18 [logger.py:42] Received request cmpl-6b8c56861da849f38fbae5331c84fff8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:18 [async_llm.py:261] Added request cmpl-6b8c56861da849f38fbae5331c84fff8-0. INFO 03-01 19:11:19 [logger.py:42] Received request cmpl-4643d671114f4ad6b90a38059ec512de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:19 [async_llm.py:261] Added request cmpl-4643d671114f4ad6b90a38059ec512de-0. INFO 03-01 19:11:20 [logger.py:42] Received request cmpl-5459569c68e6484db88f8c7a3866f0ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:20 [async_llm.py:261] Added request cmpl-5459569c68e6484db88f8c7a3866f0ac-0. INFO 03-01 19:11:21 [logger.py:42] Received request cmpl-f3c9f1c83a6d432a9d61e5fd4cbb1d51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:21 [async_llm.py:261] Added request cmpl-f3c9f1c83a6d432a9d61e5fd4cbb1d51-0. INFO 03-01 19:11:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:11:23 [logger.py:42] Received request cmpl-0eda98cdbf144993a027d51803aec1cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:23 [async_llm.py:261] Added request cmpl-0eda98cdbf144993a027d51803aec1cf-0. INFO 03-01 19:11:24 [logger.py:42] Received request cmpl-cb677b611c314cc488fb6032c6019c43-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:24 [async_llm.py:261] Added request cmpl-cb677b611c314cc488fb6032c6019c43-0. INFO 03-01 19:11:25 [logger.py:42] Received request cmpl-23b8bc479c854704838b1704d0d1f0ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:25 [async_llm.py:261] Added request cmpl-23b8bc479c854704838b1704d0d1f0ea-0. INFO 03-01 19:11:26 [logger.py:42] Received request cmpl-f1a104af285540f78cc75044ca8280d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:26 [async_llm.py:261] Added request cmpl-f1a104af285540f78cc75044ca8280d8-0. INFO 03-01 19:11:27 [logger.py:42] Received request cmpl-c4610893c4d746afa146a5f657bd5531-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:27 [async_llm.py:261] Added request cmpl-c4610893c4d746afa146a5f657bd5531-0. INFO 03-01 19:11:28 [logger.py:42] Received request cmpl-91ced511b62d47a7a4373b1c52f3d007-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:28 [async_llm.py:261] Added request cmpl-91ced511b62d47a7a4373b1c52f3d007-0. INFO 03-01 19:11:30 [logger.py:42] Received request cmpl-61b0ebbdaada4283a52ed69d4281dfad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:30 [async_llm.py:261] Added request cmpl-61b0ebbdaada4283a52ed69d4281dfad-0. INFO 03-01 19:11:31 [logger.py:42] Received request cmpl-e9c748d4cc954ab6bc82018f483bbe94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:31 [async_llm.py:261] Added request cmpl-e9c748d4cc954ab6bc82018f483bbe94-0. INFO 03-01 19:11:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:11:32 [logger.py:42] Received request cmpl-d8b75f8130c8454c8943d795ff952b44-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:32 [async_llm.py:261] Added request cmpl-d8b75f8130c8454c8943d795ff952b44-0. INFO 03-01 19:11:33 [logger.py:42] Received request cmpl-da8ec02ef3ec4f9a87e0849ed8aa622e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:33 [async_llm.py:261] Added request cmpl-da8ec02ef3ec4f9a87e0849ed8aa622e-0. INFO 03-01 19:11:34 [logger.py:42] Received request cmpl-35881236cebf4a20ad95512c2e03e6a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:34 [async_llm.py:261] Added request cmpl-35881236cebf4a20ad95512c2e03e6a8-0. INFO 03-01 19:11:35 [logger.py:42] Received request cmpl-3e835f6f72854c44af43bf4afdc8b0b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:35 [async_llm.py:261] Added request cmpl-3e835f6f72854c44af43bf4afdc8b0b5-0. INFO 03-01 19:11:37 [logger.py:42] Received request cmpl-3f6eddcf90e4477fb3a50ddd6407f9bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:37 [async_llm.py:261] Added request cmpl-3f6eddcf90e4477fb3a50ddd6407f9bc-0. INFO 03-01 19:11:38 [logger.py:42] Received request cmpl-4f4200a8b5cc4b579f6a1191a69642d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:38 [async_llm.py:261] Added request cmpl-4f4200a8b5cc4b579f6a1191a69642d8-0. INFO 03-01 19:11:39 [logger.py:42] Received request cmpl-306afb8742034af58de0fd979fc77a27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:39 [async_llm.py:261] Added request cmpl-306afb8742034af58de0fd979fc77a27-0. INFO 03-01 19:11:40 [logger.py:42] Received request cmpl-e8e31a588c2e4048b264ed5c112a1de1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:40 [async_llm.py:261] Added request cmpl-e8e31a588c2e4048b264ed5c112a1de1-0. INFO 03-01 19:11:41 [logger.py:42] Received request cmpl-3c90436269744ccd962e4c259c016545-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:41 [async_llm.py:261] Added request cmpl-3c90436269744ccd962e4c259c016545-0. INFO 03-01 19:11:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:11:42 [logger.py:42] Received request cmpl-86e2b5b4be37465d9c95f51605631ac2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:42 [async_llm.py:261] Added request cmpl-86e2b5b4be37465d9c95f51605631ac2-0. INFO 03-01 19:11:43 [logger.py:42] Received request cmpl-80906d0b1ff04d52aeebc0e4e8373f62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:43 [async_llm.py:261] Added request cmpl-80906d0b1ff04d52aeebc0e4e8373f62-0. INFO 03-01 19:11:45 [logger.py:42] Received request cmpl-7192bb8a30d5496a8d3faece05f6e5ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:45 [async_llm.py:261] Added request cmpl-7192bb8a30d5496a8d3faece05f6e5ae-0. INFO 03-01 19:11:46 [logger.py:42] Received request cmpl-a668526f27b44339900ac83249664a18-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:46 [async_llm.py:261] Added request cmpl-a668526f27b44339900ac83249664a18-0. INFO 03-01 19:11:47 [logger.py:42] Received request cmpl-c8dda41c88d143a2912d1ceff8148b46-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:47 [async_llm.py:261] Added request cmpl-c8dda41c88d143a2912d1ceff8148b46-0. INFO 03-01 19:11:48 [logger.py:42] Received request cmpl-83c29cfe512c4c25a7cde3fb19173980-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:48 [async_llm.py:261] Added request cmpl-83c29cfe512c4c25a7cde3fb19173980-0. INFO 03-01 19:11:49 [logger.py:42] Received request cmpl-7b1bbed572a94076a1ebe033a3e57d9d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:49 [async_llm.py:261] Added request cmpl-7b1bbed572a94076a1ebe033a3e57d9d-0. INFO 03-01 19:11:50 [logger.py:42] Received request cmpl-2745d6d8e3c94fa6853628ada7349c71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:50 [async_llm.py:261] Added request cmpl-2745d6d8e3c94fa6853628ada7349c71-0. INFO 03-01 19:11:52 [logger.py:42] Received request cmpl-bc65d50d031e40299c099ae872c61840-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:52 [async_llm.py:261] Added request cmpl-bc65d50d031e40299c099ae872c61840-0. INFO 03-01 19:11:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:11:53 [logger.py:42] Received request cmpl-dd48b03ffd874b8e8ec5ae32e6e77ab4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:53 [async_llm.py:261] Added request cmpl-dd48b03ffd874b8e8ec5ae32e6e77ab4-0. INFO 03-01 19:11:54 [logger.py:42] Received request cmpl-7d51e467379543dd87dd480fbbbf3203-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:54 [async_llm.py:261] Added request cmpl-7d51e467379543dd87dd480fbbbf3203-0. INFO 03-01 19:11:55 [logger.py:42] Received request cmpl-5f63d119ed494d63b71a04e645825483-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:55 [async_llm.py:261] Added request cmpl-5f63d119ed494d63b71a04e645825483-0. INFO 03-01 19:11:56 [logger.py:42] Received request cmpl-5f733f2ff05b49de98db9183749a153a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:56 [async_llm.py:261] Added request cmpl-5f733f2ff05b49de98db9183749a153a-0. INFO 03-01 19:11:57 [logger.py:42] Received request cmpl-0c3feacfd4be4f08bb7b7af3210f3f55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:57 [async_llm.py:261] Added request cmpl-0c3feacfd4be4f08bb7b7af3210f3f55-0. INFO 03-01 19:11:59 [logger.py:42] Received request cmpl-7b11b1a18ee041feab1ae071f950c2d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:11:59 [async_llm.py:261] Added request cmpl-7b11b1a18ee041feab1ae071f950c2d2-0. INFO 03-01 19:12:00 [logger.py:42] Received request cmpl-e988f69fdd2f4a5cba997ab95abfa3b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:00 [async_llm.py:261] Added request cmpl-e988f69fdd2f4a5cba997ab95abfa3b2-0. INFO 03-01 19:12:01 [logger.py:42] Received request cmpl-69b367afc0bc4a6a886969314759d2ef-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:01 [async_llm.py:261] Added request cmpl-69b367afc0bc4a6a886969314759d2ef-0. INFO 03-01 19:12:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:12:02 [logger.py:42] Received request cmpl-7dfd0245077d43a297acfab795d981bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:02 [async_llm.py:261] Added request cmpl-7dfd0245077d43a297acfab795d981bc-0. INFO 03-01 19:12:03 [logger.py:42] Received request cmpl-fbb8e0529a994442b81a925a739afe52-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:03 [async_llm.py:261] Added request cmpl-fbb8e0529a994442b81a925a739afe52-0. INFO 03-01 19:12:04 [logger.py:42] Received request cmpl-e01b178b32b04e4d9bee63d0b906e8e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:04 [async_llm.py:261] Added request cmpl-e01b178b32b04e4d9bee63d0b906e8e6-0. INFO 03-01 19:12:06 [logger.py:42] Received request cmpl-681b63ddcfaa471cb1c90bec57317af8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:06 [async_llm.py:261] Added request cmpl-681b63ddcfaa471cb1c90bec57317af8-0. INFO 03-01 19:12:07 [logger.py:42] Received request cmpl-0fa792a4d5194d80806f3a747a28883d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:07 [async_llm.py:261] Added request cmpl-0fa792a4d5194d80806f3a747a28883d-0. INFO 03-01 19:12:08 [logger.py:42] Received request cmpl-5bd243dfa2ab4e2c95ff97b0644e666c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:08 [async_llm.py:261] Added request cmpl-5bd243dfa2ab4e2c95ff97b0644e666c-0. INFO 03-01 19:12:09 [logger.py:42] Received request cmpl-f2ccdc221b8a4048b02979581e75db21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:09 [async_llm.py:261] Added request cmpl-f2ccdc221b8a4048b02979581e75db21-0. INFO 03-01 19:12:10 [logger.py:42] Received request cmpl-deffd5f65773487d92070d68d9f4d1cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:10 [async_llm.py:261] Added request cmpl-deffd5f65773487d92070d68d9f4d1cf-0. INFO 03-01 19:12:11 [logger.py:42] Received request cmpl-ff051b0ea86b48a19969665ff6fd06a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:11 [async_llm.py:261] Added request cmpl-ff051b0ea86b48a19969665ff6fd06a3-0. INFO 03-01 19:12:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:12:12 [logger.py:42] Received request cmpl-70d23ead18494f42819cc73a9b47d793-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:12 [async_llm.py:261] Added request cmpl-70d23ead18494f42819cc73a9b47d793-0. INFO 03-01 19:12:14 [logger.py:42] Received request cmpl-85ae586b722e4e3598a3a0eacdc19c03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:14 [async_llm.py:261] Added request cmpl-85ae586b722e4e3598a3a0eacdc19c03-0. INFO 03-01 19:12:15 [logger.py:42] Received request cmpl-88358532ce144fa2993019bd54ba23ab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:15 [async_llm.py:261] Added request cmpl-88358532ce144fa2993019bd54ba23ab-0. INFO 03-01 19:12:16 [logger.py:42] Received request cmpl-a48191d7b34040dbb43ab9c3c172797b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:16 [async_llm.py:261] Added request cmpl-a48191d7b34040dbb43ab9c3c172797b-0. INFO 03-01 19:12:17 [logger.py:42] Received request cmpl-d3fa3fec16f64d9f924428890ab7222c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:17 [async_llm.py:261] Added request cmpl-d3fa3fec16f64d9f924428890ab7222c-0. INFO 03-01 19:12:18 [logger.py:42] Received request cmpl-56c0fd3f0d4d483f9869f7fbc9733979-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:18 [async_llm.py:261] Added request cmpl-56c0fd3f0d4d483f9869f7fbc9733979-0. INFO 03-01 19:12:19 [logger.py:42] Received request cmpl-75d5981f79144b6989fc2ba9ec7a614d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:19 [async_llm.py:261] Added request cmpl-75d5981f79144b6989fc2ba9ec7a614d-0. INFO 03-01 19:12:21 [logger.py:42] Received request cmpl-5028d1fb57d24ed99690ec317e90f30b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:21 [async_llm.py:261] Added request cmpl-5028d1fb57d24ed99690ec317e90f30b-0. INFO 03-01 19:12:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:12:22 [logger.py:42] Received request cmpl-2a45dfb69c1242d8988048c79fd83415-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:22 [async_llm.py:261] Added request cmpl-2a45dfb69c1242d8988048c79fd83415-0. INFO 03-01 19:12:23 [logger.py:42] Received request cmpl-3c8e5243fecf41d886378735fb6648e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:23 [async_llm.py:261] Added request cmpl-3c8e5243fecf41d886378735fb6648e1-0. INFO 03-01 19:12:24 [logger.py:42] Received request cmpl-3c9d893a9ed0440db9a341deb902af10-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:24 [async_llm.py:261] Added request cmpl-3c9d893a9ed0440db9a341deb902af10-0. INFO 03-01 19:12:25 [logger.py:42] Received request cmpl-b4b5919848d14d5cb8a8a5f21b0df5fd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:25 [async_llm.py:261] Added request cmpl-b4b5919848d14d5cb8a8a5f21b0df5fd-0. INFO 03-01 19:12:26 [logger.py:42] Received request cmpl-99f336f195bb468eaa39c93f8dd1e7b8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:26 [async_llm.py:261] Added request cmpl-99f336f195bb468eaa39c93f8dd1e7b8-0. INFO 03-01 19:12:28 [logger.py:42] Received request cmpl-26839b0c8dd04f68a80f391497dc93cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:28 [async_llm.py:261] Added request cmpl-26839b0c8dd04f68a80f391497dc93cb-0. INFO 03-01 19:12:29 [logger.py:42] Received request cmpl-b7ba937873e143d385020b4af54bc056-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:29 [async_llm.py:261] Added request cmpl-b7ba937873e143d385020b4af54bc056-0. INFO 03-01 19:12:30 [logger.py:42] Received request cmpl-676e0ab518fb432fb29ce8577d37af50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:30 [async_llm.py:261] Added request cmpl-676e0ab518fb432fb29ce8577d37af50-0. INFO 03-01 19:12:31 [logger.py:42] Received request cmpl-1e04b611affb4143b109353c02d0a8de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:31 [async_llm.py:261] Added request cmpl-1e04b611affb4143b109353c02d0a8de-0. INFO 03-01 19:12:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:12:32 [logger.py:42] Received request cmpl-f35dcf88a46b49fa96d956e74724921b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:32 [async_llm.py:261] Added request cmpl-f35dcf88a46b49fa96d956e74724921b-0. INFO 03-01 19:12:33 [logger.py:42] Received request cmpl-459e78685fab4f8cbc5c593cd35a459e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:33 [async_llm.py:261] Added request cmpl-459e78685fab4f8cbc5c593cd35a459e-0. INFO 03-01 19:12:35 [logger.py:42] Received request cmpl-89147ffdf49d41859c7aab04a9958940-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:35 [async_llm.py:261] Added request cmpl-89147ffdf49d41859c7aab04a9958940-0. INFO 03-01 19:12:36 [logger.py:42] Received request cmpl-908e3fb81d264d2d8ea6ed8110728294-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:36 [async_llm.py:261] Added request cmpl-908e3fb81d264d2d8ea6ed8110728294-0. INFO 03-01 19:12:37 [logger.py:42] Received request cmpl-c24e78813be248b6a4e0be739e64efd3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:37 [async_llm.py:261] Added request cmpl-c24e78813be248b6a4e0be739e64efd3-0. INFO 03-01 19:12:38 [logger.py:42] Received request cmpl-9645f018a073447bb75c46049bb8f3bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:38 [async_llm.py:261] Added request cmpl-9645f018a073447bb75c46049bb8f3bd-0. INFO 03-01 19:12:39 [logger.py:42] Received request cmpl-f22012a1d37e4a70b7ad989300bc3901-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:39 [async_llm.py:261] Added request cmpl-f22012a1d37e4a70b7ad989300bc3901-0. INFO 03-01 19:12:40 [logger.py:42] Received request cmpl-9fb264cbad1541b19fe53474e796906a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:40 [async_llm.py:261] Added request cmpl-9fb264cbad1541b19fe53474e796906a-0. INFO 03-01 19:12:41 [logger.py:42] Received request cmpl-a93aa2ea8bc5486cabad51e32ec90d02-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:41 [async_llm.py:261] Added request cmpl-a93aa2ea8bc5486cabad51e32ec90d02-0. INFO 03-01 19:12:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:12:43 [logger.py:42] Received request cmpl-3a513426eb374e9895e4135251f12b70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:43 [async_llm.py:261] Added request cmpl-3a513426eb374e9895e4135251f12b70-0. INFO 03-01 19:12:44 [logger.py:42] Received request cmpl-dd7ed948a65e4015bc661dcf1f7a5e7c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:44 [async_llm.py:261] Added request cmpl-dd7ed948a65e4015bc661dcf1f7a5e7c-0. INFO 03-01 19:12:45 [logger.py:42] Received request cmpl-1c0ca49700ac4d50ae4499906df6cead-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:45 [async_llm.py:261] Added request cmpl-1c0ca49700ac4d50ae4499906df6cead-0. INFO 03-01 19:12:46 [logger.py:42] Received request cmpl-2dc622f8db7b4430bdf310afd7c50f5d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:46 [async_llm.py:261] Added request cmpl-2dc622f8db7b4430bdf310afd7c50f5d-0. INFO 03-01 19:12:47 [logger.py:42] Received request cmpl-7fe0b3589bdb4db69167e5a6e29f6865-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:47 [async_llm.py:261] Added request cmpl-7fe0b3589bdb4db69167e5a6e29f6865-0. INFO 03-01 19:12:48 [logger.py:42] Received request cmpl-48ef5710ba46467f9210007883584e19-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:48 [async_llm.py:261] Added request cmpl-48ef5710ba46467f9210007883584e19-0. INFO 03-01 19:12:50 [logger.py:42] Received request cmpl-2959f05eb69d4976825bd60d07f4f1cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:50 [async_llm.py:261] Added request cmpl-2959f05eb69d4976825bd60d07f4f1cb-0. INFO 03-01 19:12:51 [logger.py:42] Received request cmpl-c3ec306f9c3d4de99644281400d7a407-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:51 [async_llm.py:261] Added request cmpl-c3ec306f9c3d4de99644281400d7a407-0. INFO 03-01 19:12:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:12:52 [logger.py:42] Received request cmpl-86224f39f777421f88000a5c35949a8a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:52 [async_llm.py:261] Added request cmpl-86224f39f777421f88000a5c35949a8a-0. INFO 03-01 19:12:53 [logger.py:42] Received request cmpl-3353e0544dde44e7850f8d6c18972494-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:53 [async_llm.py:261] Added request cmpl-3353e0544dde44e7850f8d6c18972494-0. INFO 03-01 19:12:54 [logger.py:42] Received request cmpl-1421adae051f4830b6b178ccf5c2f86b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:54 [async_llm.py:261] Added request cmpl-1421adae051f4830b6b178ccf5c2f86b-0. INFO 03-01 19:12:55 [logger.py:42] Received request cmpl-a5e49c8dbe1543e4bc078af8aedfc4dd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:55 [async_llm.py:261] Added request cmpl-a5e49c8dbe1543e4bc078af8aedfc4dd-0. INFO 03-01 19:12:57 [logger.py:42] Received request cmpl-572f7cc1327d465a9647313e70f641e4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:57 [async_llm.py:261] Added request cmpl-572f7cc1327d465a9647313e70f641e4-0. INFO 03-01 19:12:58 [logger.py:42] Received request cmpl-b57d3418998e4537bd2d391e20281c72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:58 [async_llm.py:261] Added request cmpl-b57d3418998e4537bd2d391e20281c72-0. INFO 03-01 19:12:59 [logger.py:42] Received request cmpl-70f8423516e94ca094b0897b3e517970-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:12:59 [async_llm.py:261] Added request cmpl-70f8423516e94ca094b0897b3e517970-0. INFO 03-01 19:13:00 [logger.py:42] Received request cmpl-888cfbf118064e35846df5bc34db9fbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:00 [async_llm.py:261] Added request cmpl-888cfbf118064e35846df5bc34db9fbc-0. INFO 03-01 19:13:01 [logger.py:42] Received request cmpl-35670dd529e94837b3623b94c5cc9f9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:01 [async_llm.py:261] Added request cmpl-35670dd529e94837b3623b94c5cc9f9b-0. INFO 03-01 19:13:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:13:02 [logger.py:42] Received request cmpl-542c1cddb5ac46219b78e547659291ea-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:02 [async_llm.py:261] Added request cmpl-542c1cddb5ac46219b78e547659291ea-0. INFO 03-01 19:13:03 [logger.py:42] Received request cmpl-12af677eefab45d89edfae76194f804a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:04 [async_llm.py:261] Added request cmpl-12af677eefab45d89edfae76194f804a-0. INFO 03-01 19:13:05 [logger.py:42] Received request cmpl-66773283057e476ba2a740192d04e9af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:05 [async_llm.py:261] Added request cmpl-66773283057e476ba2a740192d04e9af-0. INFO 03-01 19:13:06 [logger.py:42] Received request cmpl-aea250400609411fa672c904e29daa4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:06 [async_llm.py:261] Added request cmpl-aea250400609411fa672c904e29daa4c-0. INFO 03-01 19:13:07 [logger.py:42] Received request cmpl-c5487e822fc6439cbc57c5c6c3efaf05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:07 [async_llm.py:261] Added request cmpl-c5487e822fc6439cbc57c5c6c3efaf05-0. INFO 03-01 19:13:08 [logger.py:42] Received request cmpl-87629a26954643d99f309144933e0e27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:08 [async_llm.py:261] Added request cmpl-87629a26954643d99f309144933e0e27-0. INFO 03-01 19:13:09 [logger.py:42] Received request cmpl-3ee3b55f46c5499faab871e7c6ed0628-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:09 [async_llm.py:261] Added request cmpl-3ee3b55f46c5499faab871e7c6ed0628-0. INFO 03-01 19:13:10 [logger.py:42] Received request cmpl-bfdc0f78b19a4e7d9b40bcb7005d5cac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:10 [async_llm.py:261] Added request cmpl-bfdc0f78b19a4e7d9b40bcb7005d5cac-0. INFO 03-01 19:13:12 [logger.py:42] Received request cmpl-68145f3054764edb84bba124ed7876db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:12 [async_llm.py:261] Added request cmpl-68145f3054764edb84bba124ed7876db-0. INFO 03-01 19:13:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:13:13 [logger.py:42] Received request cmpl-2d36994648ed4c869a8dea623674d3a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:13 [async_llm.py:261] Added request cmpl-2d36994648ed4c869a8dea623674d3a7-0. INFO 03-01 19:13:14 [logger.py:42] Received request cmpl-28568562b5724f5b83583beadce3a4d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:14 [async_llm.py:261] Added request cmpl-28568562b5724f5b83583beadce3a4d3-0. INFO 03-01 19:13:15 [logger.py:42] Received request cmpl-60fb429790eb40ce94c99f2f7bbdf261-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:15 [async_llm.py:261] Added request cmpl-60fb429790eb40ce94c99f2f7bbdf261-0. INFO 03-01 19:13:16 [logger.py:42] Received request cmpl-239162a119294acab025e50cf501f64a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:16 [async_llm.py:261] Added request cmpl-239162a119294acab025e50cf501f64a-0. INFO 03-01 19:13:17 [logger.py:42] Received request cmpl-b1be4b3dff7142c8884e1a616913dda4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:17 [async_llm.py:261] Added request cmpl-b1be4b3dff7142c8884e1a616913dda4-0. INFO 03-01 19:13:19 [logger.py:42] Received request cmpl-92d6914358864a1080d0a74ff52750ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:19 [async_llm.py:261] Added request cmpl-92d6914358864a1080d0a74ff52750ce-0. INFO 03-01 19:13:20 [logger.py:42] Received request cmpl-502b911cec684350aa4486726f67a111-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:20 [async_llm.py:261] Added request cmpl-502b911cec684350aa4486726f67a111-0. INFO 03-01 19:13:21 [logger.py:42] Received request cmpl-cf00211428bb42ac98e8886435b58c0e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:21 [async_llm.py:261] Added request cmpl-cf00211428bb42ac98e8886435b58c0e-0. INFO 03-01 19:13:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:13:22 [logger.py:42] Received request cmpl-76a83abf877444ef932fe1ee00c711ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:22 [async_llm.py:261] Added request cmpl-76a83abf877444ef932fe1ee00c711ff-0. INFO 03-01 19:13:23 [logger.py:42] Received request cmpl-86164f61b6e543d7a5f818f2830cff32-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:23 [async_llm.py:261] Added request cmpl-86164f61b6e543d7a5f818f2830cff32-0. INFO 03-01 19:13:24 [logger.py:42] Received request cmpl-33b2e830a19c44a691db2d22d611b711-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:24 [async_llm.py:261] Added request cmpl-33b2e830a19c44a691db2d22d611b711-0. INFO 03-01 19:13:26 [logger.py:42] Received request cmpl-1d662b810cdb4a17a99ae6844e8fb5d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:26 [async_llm.py:261] Added request cmpl-1d662b810cdb4a17a99ae6844e8fb5d2-0. INFO 03-01 19:13:27 [logger.py:42] Received request cmpl-6c7fd719ef5149a88ee8996eb0829391-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:27 [async_llm.py:261] Added request cmpl-6c7fd719ef5149a88ee8996eb0829391-0. INFO 03-01 19:13:28 [logger.py:42] Received request cmpl-de98d479c92a4862a87190043956d39a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:28 [async_llm.py:261] Added request cmpl-de98d479c92a4862a87190043956d39a-0. INFO 03-01 19:13:29 [logger.py:42] Received request cmpl-fade8173a79e4f75b0f12ab8124e1c7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:29 [async_llm.py:261] Added request cmpl-fade8173a79e4f75b0f12ab8124e1c7e-0. INFO 03-01 19:13:30 [logger.py:42] Received request cmpl-b8b3386e8b174fb9bdbf00d08d8e17af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:30 [async_llm.py:261] Added request cmpl-b8b3386e8b174fb9bdbf00d08d8e17af-0. INFO 03-01 19:13:31 [logger.py:42] Received request cmpl-bc5028b52388427284a4ad3602fe8b35-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:31 [async_llm.py:261] Added request cmpl-bc5028b52388427284a4ad3602fe8b35-0. INFO 03-01 19:13:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:13:32 [logger.py:42] Received request cmpl-59f8f12cee5840edb3e8d2b163d0bbcf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:32 [async_llm.py:261] Added request cmpl-59f8f12cee5840edb3e8d2b163d0bbcf-0. INFO 03-01 19:13:34 [logger.py:42] Received request cmpl-49aef06bd0bc4eb293d3702852865a86-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:34 [async_llm.py:261] Added request cmpl-49aef06bd0bc4eb293d3702852865a86-0. INFO 03-01 19:13:35 [logger.py:42] Received request cmpl-fd1cebbe0620441d8e75c8ab9031353d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:35 [async_llm.py:261] Added request cmpl-fd1cebbe0620441d8e75c8ab9031353d-0. INFO 03-01 19:13:36 [logger.py:42] Received request cmpl-b9aeb3fb7f9542ddac8a44bda8535b1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:36 [async_llm.py:261] Added request cmpl-b9aeb3fb7f9542ddac8a44bda8535b1c-0. INFO 03-01 19:13:37 [logger.py:42] Received request cmpl-598c430d963e495cab627bc140105380-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:37 [async_llm.py:261] Added request cmpl-598c430d963e495cab627bc140105380-0. INFO 03-01 19:13:38 [logger.py:42] Received request cmpl-43f1cf88852e4fd98c833ea95ee3f955-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:38 [async_llm.py:261] Added request cmpl-43f1cf88852e4fd98c833ea95ee3f955-0. INFO 03-01 19:13:39 [logger.py:42] Received request cmpl-fdc45bff55c94cee971632093a126251-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:39 [async_llm.py:261] Added request cmpl-fdc45bff55c94cee971632093a126251-0. INFO 03-01 19:13:41 [logger.py:42] Received request cmpl-c678c754d37c40a1a56492e9c4d70571-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:41 [async_llm.py:261] Added request cmpl-c678c754d37c40a1a56492e9c4d70571-0. INFO 03-01 19:13:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:13:42 [logger.py:42] Received request cmpl-3b6db13e37dd4a948b7d3ef49d982e1f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:42 [async_llm.py:261] Added request cmpl-3b6db13e37dd4a948b7d3ef49d982e1f-0. INFO 03-01 19:13:43 [logger.py:42] Received request cmpl-1fbf8caf62944296b6c81d4ee4c6ce94-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:43 [async_llm.py:261] Added request cmpl-1fbf8caf62944296b6c81d4ee4c6ce94-0. INFO 03-01 19:13:44 [logger.py:42] Received request cmpl-0973277557c248bf91d0cb73cb6ea712-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:44 [async_llm.py:261] Added request cmpl-0973277557c248bf91d0cb73cb6ea712-0. INFO 03-01 19:13:45 [logger.py:42] Received request cmpl-4d2d115ab4154facbada70e01608053d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:45 [async_llm.py:261] Added request cmpl-4d2d115ab4154facbada70e01608053d-0. INFO 03-01 19:13:46 [logger.py:42] Received request cmpl-1014ec01a2c14452a8276c027b15a5a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:46 [async_llm.py:261] Added request cmpl-1014ec01a2c14452a8276c027b15a5a5-0. INFO 03-01 19:13:48 [logger.py:42] Received request cmpl-a18aa90411f7403caae21a9cdfe1694d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:48 [async_llm.py:261] Added request cmpl-a18aa90411f7403caae21a9cdfe1694d-0. INFO 03-01 19:13:49 [logger.py:42] Received request cmpl-d51da619e35b45d1b635eb97453260b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:49 [async_llm.py:261] Added request cmpl-d51da619e35b45d1b635eb97453260b2-0. INFO 03-01 19:13:50 [logger.py:42] Received request cmpl-41b1eff7dc904b509f8febf4eb21f5ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:50 [async_llm.py:261] Added request cmpl-41b1eff7dc904b509f8febf4eb21f5ac-0. INFO 03-01 19:13:51 [logger.py:42] Received request cmpl-70a2e307c806466385bedcb9379d7b1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:51 [async_llm.py:261] Added request cmpl-70a2e307c806466385bedcb9379d7b1d-0. INFO 03-01 19:13:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:13:52 [logger.py:42] Received request cmpl-715d545be9d34f1ba68f5438050b7be8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:52 [async_llm.py:261] Added request cmpl-715d545be9d34f1ba68f5438050b7be8-0. INFO 03-01 19:13:53 [logger.py:42] Received request cmpl-beb946bcf06f485a9a5d535c6122f2e5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:53 [async_llm.py:261] Added request cmpl-beb946bcf06f485a9a5d535c6122f2e5-0. INFO 03-01 19:13:55 [logger.py:42] Received request cmpl-ea4008ef1c964138a8000479dddcf15f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:55 [async_llm.py:261] Added request cmpl-ea4008ef1c964138a8000479dddcf15f-0. INFO 03-01 19:13:56 [logger.py:42] Received request cmpl-9d0264326f5d4a7b8b745d95230e9d66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:56 [async_llm.py:261] Added request cmpl-9d0264326f5d4a7b8b745d95230e9d66-0. INFO 03-01 19:13:57 [logger.py:42] Received request cmpl-0a4411ab944a4fac804169e8d3ba7208-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:57 [async_llm.py:261] Added request cmpl-0a4411ab944a4fac804169e8d3ba7208-0. INFO 03-01 19:13:58 [logger.py:42] Received request cmpl-b3db9deb7a6b43bf92f6d798361ad32b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:58 [async_llm.py:261] Added request cmpl-b3db9deb7a6b43bf92f6d798361ad32b-0. INFO 03-01 19:13:59 [logger.py:42] Received request cmpl-371684fa00094360b1ec92623e132b6b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:13:59 [async_llm.py:261] Added request cmpl-371684fa00094360b1ec92623e132b6b-0. INFO 03-01 19:14:01 [logger.py:42] Received request cmpl-624cf10c1d584a34adeee05700065dd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:01 [async_llm.py:261] Added request cmpl-624cf10c1d584a34adeee05700065dd1-0. INFO 03-01 19:14:02 [logger.py:42] Received request cmpl-b3e008db03954c33b43c395bdf11f106-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:02 [async_llm.py:261] Added request cmpl-b3e008db03954c33b43c395bdf11f106-0. INFO 03-01 19:14:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6% INFO 03-01 19:14:03 [logger.py:42] Received request cmpl-90af51ae1eb74d3599cbac8041d01bdc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:03 [async_llm.py:261] Added request cmpl-90af51ae1eb74d3599cbac8041d01bdc-0. INFO 03-01 19:14:04 [logger.py:42] Received request cmpl-0ea8ca820e3342bcbe949bf27c4054ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:04 [async_llm.py:261] Added request cmpl-0ea8ca820e3342bcbe949bf27c4054ad-0. INFO 03-01 19:14:05 [logger.py:42] Received request cmpl-ffc0c290f20c43ceb0c9f91c43e93df3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:05 [async_llm.py:261] Added request cmpl-ffc0c290f20c43ceb0c9f91c43e93df3-0. INFO 03-01 19:14:06 [logger.py:42] Received request cmpl-f1f043948a6a44718a1e77f9415c49f6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:06 [async_llm.py:261] Added request cmpl-f1f043948a6a44718a1e77f9415c49f6-0. INFO 03-01 19:14:07 [logger.py:42] Received request cmpl-32e6f48a2c4c4a4ca163442992807006-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:07 [async_llm.py:261] Added request cmpl-32e6f48a2c4c4a4ca163442992807006-0. INFO 03-01 19:14:09 [logger.py:42] Received request cmpl-d0d690bd4d8d40f09d98214fa32f04f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:09 [async_llm.py:261] Added request cmpl-d0d690bd4d8d40f09d98214fa32f04f9-0. INFO 03-01 19:14:10 [logger.py:42] Received request cmpl-17b6d18a22944103a5e3b46cf79ee7ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:10 [async_llm.py:261] Added request cmpl-17b6d18a22944103a5e3b46cf79ee7ca-0. INFO 03-01 19:14:11 [logger.py:42] Received request cmpl-5c7da60d0c1e42de8690dc1e3734f6cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:11 [async_llm.py:261] Added request cmpl-5c7da60d0c1e42de8690dc1e3734f6cc-0. INFO 03-01 19:14:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:14:12 [logger.py:42] Received request cmpl-13c69193eeed48a984f37b6983b7bf77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:12 [async_llm.py:261] Added request cmpl-13c69193eeed48a984f37b6983b7bf77-0. INFO 03-01 19:14:13 [logger.py:42] Received request cmpl-f963c8c91980414a98f503b1edfc8d8b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:13 [async_llm.py:261] Added request cmpl-f963c8c91980414a98f503b1edfc8d8b-0. INFO 03-01 19:14:14 [logger.py:42] Received request cmpl-196d6195e93e4095a1a495690dadcbc6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:14 [async_llm.py:261] Added request cmpl-196d6195e93e4095a1a495690dadcbc6-0. INFO 03-01 19:14:16 [logger.py:42] Received request cmpl-f0dac09f5d1f43728122bd2599721f98-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:16 [async_llm.py:261] Added request cmpl-f0dac09f5d1f43728122bd2599721f98-0. INFO 03-01 19:14:17 [logger.py:42] Received request cmpl-e98584042db947e7a592a7d2a69fae61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:17 [async_llm.py:261] Added request cmpl-e98584042db947e7a592a7d2a69fae61-0. INFO 03-01 19:14:18 [logger.py:42] Received request cmpl-b2db5eb0b0a24b3db24c87d537f746c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:18 [async_llm.py:261] Added request cmpl-b2db5eb0b0a24b3db24c87d537f746c3-0. INFO 03-01 19:14:19 [logger.py:42] Received request cmpl-014d55da20944144b31b0b88cd070934-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:19 [async_llm.py:261] Added request cmpl-014d55da20944144b31b0b88cd070934-0. INFO 03-01 19:14:20 [logger.py:42] Received request cmpl-af4df0a58bb6443f83b20b571141fc1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:20 [async_llm.py:261] Added request cmpl-af4df0a58bb6443f83b20b571141fc1c-0. INFO 03-01 19:14:21 [logger.py:42] Received request cmpl-e6a7e925ba3f4bceb6fc5c7196afdd7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:21 [async_llm.py:261] Added request cmpl-e6a7e925ba3f4bceb6fc5c7196afdd7f-0. INFO 03-01 19:14:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:14:23 [logger.py:42] Received request cmpl-764751c0ba2e46018c488767bf2fbcf0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:23 [async_llm.py:261] Added request cmpl-764751c0ba2e46018c488767bf2fbcf0-0. INFO 03-01 19:14:24 [logger.py:42] Received request cmpl-e0c64f1e0351400e9b979a291fba4ceb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:24 [async_llm.py:261] Added request cmpl-e0c64f1e0351400e9b979a291fba4ceb-0. INFO 03-01 19:14:25 [logger.py:42] Received request cmpl-434951b76536499db1db3e991506b5f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:25 [async_llm.py:261] Added request cmpl-434951b76536499db1db3e991506b5f0-0. INFO 03-01 19:14:26 [logger.py:42] Received request cmpl-741e498f23db4eab84863cfbc0b2fbfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:26 [async_llm.py:261] Added request cmpl-741e498f23db4eab84863cfbc0b2fbfd-0. INFO 03-01 19:14:27 [logger.py:42] Received request cmpl-affe4b4d9ef447f18af1d864cf8cf2a0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:27 [async_llm.py:261] Added request cmpl-affe4b4d9ef447f18af1d864cf8cf2a0-0. INFO 03-01 19:14:28 [logger.py:42] Received request cmpl-3511124ecd0a460fab71de1af8acabc9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:28 [async_llm.py:261] Added request cmpl-3511124ecd0a460fab71de1af8acabc9-0. INFO 03-01 19:14:29 [logger.py:42] Received request cmpl-49f3b1b6b74347e08f4519015c30d80f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:29 [async_llm.py:261] Added request cmpl-49f3b1b6b74347e08f4519015c30d80f-0. INFO 03-01 19:14:31 [logger.py:42] Received request cmpl-7fbc0ffb6bbc4b2a9c77ff7f5311486d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:31 [async_llm.py:261] Added request cmpl-7fbc0ffb6bbc4b2a9c77ff7f5311486d-0. INFO 03-01 19:14:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:14:32 [logger.py:42] Received request cmpl-025462eb676a46f09e99f0599987fa1c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:32 [async_llm.py:261] Added request cmpl-025462eb676a46f09e99f0599987fa1c-0. INFO 03-01 19:14:33 [logger.py:42] Received request cmpl-111db5ba6c82499f91283b4aa0715f11-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:33 [async_llm.py:261] Added request cmpl-111db5ba6c82499f91283b4aa0715f11-0. INFO 03-01 19:14:34 [logger.py:42] Received request cmpl-a490b53ad7a345d5afb7e1555195981d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:34 [async_llm.py:261] Added request cmpl-a490b53ad7a345d5afb7e1555195981d-0. INFO 03-01 19:14:35 [logger.py:42] Received request cmpl-e0048239b3764a9b8cb76b955e3fd244-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:35 [async_llm.py:261] Added request cmpl-e0048239b3764a9b8cb76b955e3fd244-0. INFO 03-01 19:14:36 [logger.py:42] Received request cmpl-51926f3a3b3e4766bb7b745622896808-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:36 [async_llm.py:261] Added request cmpl-51926f3a3b3e4766bb7b745622896808-0. INFO 03-01 19:14:38 [logger.py:42] Received request cmpl-2048f21d2943418b9d375757aa7d8612-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:38 [async_llm.py:261] Added request cmpl-2048f21d2943418b9d375757aa7d8612-0. INFO 03-01 19:14:39 [logger.py:42] Received request cmpl-3e182670e9ba494fbd3096a3e1ade8d2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:39 [async_llm.py:261] Added request cmpl-3e182670e9ba494fbd3096a3e1ade8d2-0. INFO 03-01 19:14:40 [logger.py:42] Received request cmpl-0f0fd6aa159a448ebb965a09955a31eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:40 [async_llm.py:261] Added request cmpl-0f0fd6aa159a448ebb965a09955a31eb-0. INFO 03-01 19:14:41 [logger.py:42] Received request cmpl-1c7445d710ac464f8676f85588d6faab-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:41 [async_llm.py:261] Added request cmpl-1c7445d710ac464f8676f85588d6faab-0. INFO 03-01 19:14:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:14:42 [logger.py:42] Received request cmpl-ccaf082132484cbeb11cb5735e13701b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:42 [async_llm.py:261] Added request cmpl-ccaf082132484cbeb11cb5735e13701b-0. INFO 03-01 19:14:43 [logger.py:42] Received request cmpl-fcec1aec1f914ca3a7a1d65e618ffb27-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:43 [async_llm.py:261] Added request cmpl-fcec1aec1f914ca3a7a1d65e618ffb27-0. INFO 03-01 19:14:44 [logger.py:42] Received request cmpl-53bcd5657264429fa2dc891932927945-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:44 [async_llm.py:261] Added request cmpl-53bcd5657264429fa2dc891932927945-0. INFO 03-01 19:14:46 [logger.py:42] Received request cmpl-1022946c892d4daf813de85f4a210da3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:46 [async_llm.py:261] Added request cmpl-1022946c892d4daf813de85f4a210da3-0. INFO 03-01 19:14:47 [logger.py:42] Received request cmpl-4d8127088463428ab90ffb1f83d46827-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:47 [async_llm.py:261] Added request cmpl-4d8127088463428ab90ffb1f83d46827-0. INFO 03-01 19:14:48 [logger.py:42] Received request cmpl-a7817febc1294206ad3100ed0c157a3d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:48 [async_llm.py:261] Added request cmpl-a7817febc1294206ad3100ed0c157a3d-0. INFO 03-01 19:14:49 [logger.py:42] Received request cmpl-86de00782d2a4ed08edbb16785c973e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:49 [async_llm.py:261] Added request cmpl-86de00782d2a4ed08edbb16785c973e6-0. INFO 03-01 19:14:50 [logger.py:42] Received request cmpl-d49cc4335878434882e306f32fc0da7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:50 [async_llm.py:261] Added request cmpl-d49cc4335878434882e306f32fc0da7a-0. INFO 03-01 19:14:51 [logger.py:42] Received request cmpl-c108b27c705748c1a54ae7e98a40433e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:51 [async_llm.py:261] Added request cmpl-c108b27c705748c1a54ae7e98a40433e-0. INFO 03-01 19:14:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:14:53 [logger.py:42] Received request cmpl-1a22d4b5c31a49c781e7baf54f9e718a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:53 [async_llm.py:261] Added request cmpl-1a22d4b5c31a49c781e7baf54f9e718a-0. INFO 03-01 19:14:54 [logger.py:42] Received request cmpl-e62e73320a074c6dae019ed44fc05789-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:54 [async_llm.py:261] Added request cmpl-e62e73320a074c6dae019ed44fc05789-0. INFO 03-01 19:14:55 [logger.py:42] Received request cmpl-7fb86c561a6346a3a2725c75e7b00a63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:55 [async_llm.py:261] Added request cmpl-7fb86c561a6346a3a2725c75e7b00a63-0. INFO 03-01 19:14:56 [logger.py:42] Received request cmpl-0887fab5c48d49c38feff193021f1761-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:56 [async_llm.py:261] Added request cmpl-0887fab5c48d49c38feff193021f1761-0. INFO 03-01 19:14:57 [logger.py:42] Received request cmpl-68aaf4fd9cb24d6ab2825f788f5cad6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:57 [async_llm.py:261] Added request cmpl-68aaf4fd9cb24d6ab2825f788f5cad6f-0. INFO 03-01 19:14:58 [logger.py:42] Received request cmpl-c8c662a5b05d4c6eb92d4388dcb76fa6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:14:58 [async_llm.py:261] Added request cmpl-c8c662a5b05d4c6eb92d4388dcb76fa6-0. INFO 03-01 19:15:00 [logger.py:42] Received request cmpl-29226ccfc3494cf6a379b6f244ffa9bb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:00 [async_llm.py:261] Added request cmpl-29226ccfc3494cf6a379b6f244ffa9bb-0. INFO 03-01 19:15:01 [logger.py:42] Received request cmpl-4bde0190356545f09692dffbfdaf3d91-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:01 [async_llm.py:261] Added request cmpl-4bde0190356545f09692dffbfdaf3d91-0. INFO 03-01 19:15:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:15:02 [logger.py:42] Received request cmpl-26c1a6fd9e544bc583f0bb3125b78557-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:02 [async_llm.py:261] Added request cmpl-26c1a6fd9e544bc583f0bb3125b78557-0. INFO 03-01 19:15:03 [logger.py:42] Received request cmpl-259172a8e8e8498d8353600795f7cea3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:03 [async_llm.py:261] Added request cmpl-259172a8e8e8498d8353600795f7cea3-0. INFO 03-01 19:15:04 [logger.py:42] Received request cmpl-9f43570849f74b479fa3471a34517d78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:04 [async_llm.py:261] Added request cmpl-9f43570849f74b479fa3471a34517d78-0. INFO 03-01 19:15:05 [logger.py:42] Received request cmpl-4add9ff1e178490f823c62cd69e5b531-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:05 [async_llm.py:261] Added request cmpl-4add9ff1e178490f823c62cd69e5b531-0. INFO 03-01 19:15:07 [logger.py:42] Received request cmpl-d131e4415dec477380e5086ea0515023-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:07 [async_llm.py:261] Added request cmpl-d131e4415dec477380e5086ea0515023-0. INFO 03-01 19:15:08 [logger.py:42] Received request cmpl-2ec2d95102fc4e84be640367de058624-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:08 [async_llm.py:261] Added request cmpl-2ec2d95102fc4e84be640367de058624-0. INFO 03-01 19:15:09 [logger.py:42] Received request cmpl-cbc573dab26d45a08cecc12c923a48d9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:09 [async_llm.py:261] Added request cmpl-cbc573dab26d45a08cecc12c923a48d9-0. INFO 03-01 19:15:10 [logger.py:42] Received request cmpl-5974168316364760843b17237ced983a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:10 [async_llm.py:261] Added request cmpl-5974168316364760843b17237ced983a-0. INFO 03-01 19:15:11 [logger.py:42] Received request cmpl-19abcb63ba864536bc5c576858a3bdbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:11 [async_llm.py:261] Added request cmpl-19abcb63ba864536bc5c576858a3bdbf-0. INFO 03-01 19:15:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:15:12 [logger.py:42] Received request cmpl-3787cba786d5428bb5f0c2179188cdc8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:12 [async_llm.py:261] Added request cmpl-3787cba786d5428bb5f0c2179188cdc8-0. INFO 03-01 19:15:13 [logger.py:42] Received request cmpl-8b10a29f60514cb8bb1545b43516a451-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:13 [async_llm.py:261] Added request cmpl-8b10a29f60514cb8bb1545b43516a451-0. INFO 03-01 19:15:15 [logger.py:42] Received request cmpl-7d550811639f482bab4e028073fdb2df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:15 [async_llm.py:261] Added request cmpl-7d550811639f482bab4e028073fdb2df-0. INFO 03-01 19:15:16 [logger.py:42] Received request cmpl-35bff45ab4714c9e8af27236086ec5c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:16 [async_llm.py:261] Added request cmpl-35bff45ab4714c9e8af27236086ec5c0-0. INFO 03-01 19:15:17 [logger.py:42] Received request cmpl-a55ba821ab8f44c693996a87af404848-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:17 [async_llm.py:261] Added request cmpl-a55ba821ab8f44c693996a87af404848-0. INFO 03-01 19:15:18 [logger.py:42] Received request cmpl-3d59d6b76fcd434ba404ec57fb58cad7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:18 [async_llm.py:261] Added request cmpl-3d59d6b76fcd434ba404ec57fb58cad7-0. INFO 03-01 19:15:19 [logger.py:42] Received request cmpl-2d81a3fedcf44ce793a640ac8ee13b38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:19 [async_llm.py:261] Added request cmpl-2d81a3fedcf44ce793a640ac8ee13b38-0. INFO 03-01 19:15:20 [logger.py:42] Received request cmpl-db8d89f18c8344efa277741422d57679-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:20 [async_llm.py:261] Added request cmpl-db8d89f18c8344efa277741422d57679-0. INFO 03-01 19:15:22 [logger.py:42] Received request cmpl-5867710f3f444d50a88ce93f01ec4531-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:22 [async_llm.py:261] Added request cmpl-5867710f3f444d50a88ce93f01ec4531-0. INFO 03-01 19:15:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:15:23 [logger.py:42] Received request cmpl-2e15d1518ca64a02a94787edef50fb5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:23 [async_llm.py:261] Added request cmpl-2e15d1518ca64a02a94787edef50fb5c-0. INFO 03-01 19:15:24 [logger.py:42] Received request cmpl-c82fdb7688594b8fa3d92691e40a4361-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:24 [async_llm.py:261] Added request cmpl-c82fdb7688594b8fa3d92691e40a4361-0. INFO 03-01 19:15:25 [logger.py:42] Received request cmpl-8a33a0ff44674a4ebd0587936ae77180-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:25 [async_llm.py:261] Added request cmpl-8a33a0ff44674a4ebd0587936ae77180-0. INFO 03-01 19:15:26 [logger.py:42] Received request cmpl-15d1d3e91f634077ad6c700aaa1a6a68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:26 [async_llm.py:261] Added request cmpl-15d1d3e91f634077ad6c700aaa1a6a68-0. INFO 03-01 19:15:27 [logger.py:42] Received request cmpl-8711c92c19fc49ea9cdbd0d00682b275-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:27 [async_llm.py:261] Added request cmpl-8711c92c19fc49ea9cdbd0d00682b275-0. INFO 03-01 19:15:29 [logger.py:42] Received request cmpl-94d483d887f043d08335aba101e8abf3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:29 [async_llm.py:261] Added request cmpl-94d483d887f043d08335aba101e8abf3-0. INFO 03-01 19:15:30 [logger.py:42] Received request cmpl-dbf4147d80f94deea5960f1f3bbad62c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:30 [async_llm.py:261] Added request cmpl-dbf4147d80f94deea5960f1f3bbad62c-0. INFO 03-01 19:15:31 [logger.py:42] Received request cmpl-6bfb58e2dbef4f15839c279b6234020d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:31 [async_llm.py:261] Added request cmpl-6bfb58e2dbef4f15839c279b6234020d-0. INFO 03-01 19:15:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:15:32 [logger.py:42] Received request cmpl-eb1672887fd1434fa766f1506204a0ec-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:32 [async_llm.py:261] Added request cmpl-eb1672887fd1434fa766f1506204a0ec-0. INFO 03-01 19:15:33 [logger.py:42] Received request cmpl-b68e8b6b3a9b48b2b26f2353a09b62a4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:33 [async_llm.py:261] Added request cmpl-b68e8b6b3a9b48b2b26f2353a09b62a4-0. INFO 03-01 19:15:34 [logger.py:42] Received request cmpl-a890ca98bcd346f4bb70551f95551f6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:34 [async_llm.py:261] Added request cmpl-a890ca98bcd346f4bb70551f95551f6c-0. INFO 03-01 19:15:36 [logger.py:42] Received request cmpl-99c939d97f674d328f6b9139b322a732-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:36 [async_llm.py:261] Added request cmpl-99c939d97f674d328f6b9139b322a732-0. INFO 03-01 19:15:37 [logger.py:42] Received request cmpl-fd703c93827d4502be9835153095fc4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:37 [async_llm.py:261] Added request cmpl-fd703c93827d4502be9835153095fc4e-0. INFO 03-01 19:15:38 [logger.py:42] Received request cmpl-87f049f6f9484be69e1ce3d3ba48f7f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:38 [async_llm.py:261] Added request cmpl-87f049f6f9484be69e1ce3d3ba48f7f1-0. INFO 03-01 19:15:39 [logger.py:42] Received request cmpl-808ac23c648b4e3a90d26791468541c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:39 [async_llm.py:261] Added request cmpl-808ac23c648b4e3a90d26791468541c1-0. INFO 03-01 19:15:40 [logger.py:42] Received request cmpl-ecb8617039524ab5bcdd1f68d31edaa2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:40 [async_llm.py:261] Added request cmpl-ecb8617039524ab5bcdd1f68d31edaa2-0. INFO 03-01 19:15:41 [logger.py:42] Received request cmpl-3011fb9158a54b11b93caaabf5e04635-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:41 [async_llm.py:261] Added request cmpl-3011fb9158a54b11b93caaabf5e04635-0. INFO 03-01 19:15:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:15:43 [logger.py:42] Received request cmpl-14f967faf15d484d97dfd7365602b8b3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:43 [async_llm.py:261] Added request cmpl-14f967faf15d484d97dfd7365602b8b3-0. INFO 03-01 19:15:44 [logger.py:42] Received request cmpl-40b71bc6aad4452e85edff19c0e38536-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:44 [async_llm.py:261] Added request cmpl-40b71bc6aad4452e85edff19c0e38536-0. INFO 03-01 19:15:45 [logger.py:42] Received request cmpl-bd193cdda27f4c2a9dda10aabb2370f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:45 [async_llm.py:261] Added request cmpl-bd193cdda27f4c2a9dda10aabb2370f4-0. INFO 03-01 19:15:46 [logger.py:42] Received request cmpl-1f745f23883a4e2289ae7807283c9875-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:46 [async_llm.py:261] Added request cmpl-1f745f23883a4e2289ae7807283c9875-0. INFO 03-01 19:15:47 [logger.py:42] Received request cmpl-f5b712dbc0f949c986421cd61b154251-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:47 [async_llm.py:261] Added request cmpl-f5b712dbc0f949c986421cd61b154251-0. INFO 03-01 19:15:48 [logger.py:42] Received request cmpl-7fdf9d67986d40e099bc92bccb2e6fc3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:48 [async_llm.py:261] Added request cmpl-7fdf9d67986d40e099bc92bccb2e6fc3-0. INFO 03-01 19:15:49 [logger.py:42] Received request cmpl-b9c5b0bd05744b50ab65f470f33c1340-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:49 [async_llm.py:261] Added request cmpl-b9c5b0bd05744b50ab65f470f33c1340-0. INFO 03-01 19:15:51 [logger.py:42] Received request cmpl-dfe6264bbce24b0ab32a948abb74c0be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:51 [async_llm.py:261] Added request cmpl-dfe6264bbce24b0ab32a948abb74c0be-0. INFO 03-01 19:15:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:15:52 [logger.py:42] Received request cmpl-597627e31d1a4aa2bb9e79c6b21643f3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:52 [async_llm.py:261] Added request cmpl-597627e31d1a4aa2bb9e79c6b21643f3-0. INFO 03-01 19:15:53 [logger.py:42] Received request cmpl-b3c67d8e31d041fda2cd0a47c35c972e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:53 [async_llm.py:261] Added request cmpl-b3c67d8e31d041fda2cd0a47c35c972e-0. INFO 03-01 19:15:54 [logger.py:42] Received request cmpl-587262ce21f44a95bfaedadc185af302-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:54 [async_llm.py:261] Added request cmpl-587262ce21f44a95bfaedadc185af302-0. INFO 03-01 19:15:55 [logger.py:42] Received request cmpl-f10f0ea86d514252b2b8e9249b6d9f66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:55 [async_llm.py:261] Added request cmpl-f10f0ea86d514252b2b8e9249b6d9f66-0. INFO 03-01 19:15:56 [logger.py:42] Received request cmpl-74cf827468c44e5cb16f1b82872b9dde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:56 [async_llm.py:261] Added request cmpl-74cf827468c44e5cb16f1b82872b9dde-0. INFO 03-01 19:15:58 [logger.py:42] Received request cmpl-06642958d330402785c03f204d339a48-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:58 [async_llm.py:261] Added request cmpl-06642958d330402785c03f204d339a48-0. INFO 03-01 19:15:59 [logger.py:42] Received request cmpl-d7efe27b82e244a0aa8fd96012792019-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:15:59 [async_llm.py:261] Added request cmpl-d7efe27b82e244a0aa8fd96012792019-0. INFO 03-01 19:16:00 [logger.py:42] Received request cmpl-fc2908709e5b4186b36b80df4e3f9d0f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:00 [async_llm.py:261] Added request cmpl-fc2908709e5b4186b36b80df4e3f9d0f-0. INFO 03-01 19:16:01 [logger.py:42] Received request cmpl-1d00f936f61c4fdeb5936d4b7a65b5b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:01 [async_llm.py:261] Added request cmpl-1d00f936f61c4fdeb5936d4b7a65b5b9-0. INFO 03-01 19:16:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:16:02 [logger.py:42] Received request cmpl-b0208e912ab4408dbabbfbd82d268038-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:02 [async_llm.py:261] Added request cmpl-b0208e912ab4408dbabbfbd82d268038-0. INFO 03-01 19:16:03 [logger.py:42] Received request cmpl-4046d7a0d41643b9934d2202d895e6bf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:03 [async_llm.py:261] Added request cmpl-4046d7a0d41643b9934d2202d895e6bf-0. INFO 03-01 19:16:05 [logger.py:42] Received request cmpl-05c7642153404a278a07118e63f5a4ad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:05 [async_llm.py:261] Added request cmpl-05c7642153404a278a07118e63f5a4ad-0. INFO 03-01 19:16:06 [logger.py:42] Received request cmpl-abcaea67bb034aac840924ebc75851f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:06 [async_llm.py:261] Added request cmpl-abcaea67bb034aac840924ebc75851f1-0. INFO 03-01 19:16:07 [logger.py:42] Received request cmpl-2048ff368d924706ad099b09d72b8b76-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:07 [async_llm.py:261] Added request cmpl-2048ff368d924706ad099b09d72b8b76-0. INFO 03-01 19:16:08 [logger.py:42] Received request cmpl-632dd29e97fa4cb0bc7139921d0b4b14-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:08 [async_llm.py:261] Added request cmpl-632dd29e97fa4cb0bc7139921d0b4b14-0. INFO 03-01 19:16:09 [logger.py:42] Received request cmpl-f8abe6b351924b879ffdca453d033cad-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:09 [async_llm.py:261] Added request cmpl-f8abe6b351924b879ffdca453d033cad-0. INFO 03-01 19:16:10 [logger.py:42] Received request cmpl-94148743223648c197e9f486d9a8a312-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:10 [async_llm.py:261] Added request cmpl-94148743223648c197e9f486d9a8a312-0. INFO 03-01 19:16:11 [logger.py:42] Received request cmpl-7d18cbdb67a14339b0d15f82a32c77c3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:11 [async_llm.py:261] Added request cmpl-7d18cbdb67a14339b0d15f82a32c77c3-0. INFO 03-01 19:16:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:16:13 [logger.py:42] Received request cmpl-5085f73aea0a4000b9e7e6b5a2f65895-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:13 [async_llm.py:261] Added request cmpl-5085f73aea0a4000b9e7e6b5a2f65895-0. INFO 03-01 19:16:14 [logger.py:42] Received request cmpl-6001cd7253d84368acc62dc2fba2a499-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:14 [async_llm.py:261] Added request cmpl-6001cd7253d84368acc62dc2fba2a499-0. INFO 03-01 19:16:15 [logger.py:42] Received request cmpl-7cdb79b5df714e7ca51a40c5c73c01b6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:15 [async_llm.py:261] Added request cmpl-7cdb79b5df714e7ca51a40c5c73c01b6-0. INFO 03-01 19:16:16 [logger.py:42] Received request cmpl-fffb5c898e5d40d98bfac82b11ef0b3e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:16 [async_llm.py:261] Added request cmpl-fffb5c898e5d40d98bfac82b11ef0b3e-0. INFO 03-01 19:16:17 [logger.py:42] Received request cmpl-e992c34827ef4d569e41f4f509a0b026-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:17 [async_llm.py:261] Added request cmpl-e992c34827ef4d569e41f4f509a0b026-0. INFO 03-01 19:16:18 [logger.py:42] Received request cmpl-bf36008599a84f7d8927cbfa52391471-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:18 [async_llm.py:261] Added request cmpl-bf36008599a84f7d8927cbfa52391471-0. INFO 03-01 19:16:20 [logger.py:42] Received request cmpl-106a3ec95502413e9ed5c6c395bebfc7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:20 [async_llm.py:261] Added request cmpl-106a3ec95502413e9ed5c6c395bebfc7-0. INFO 03-01 19:16:21 [logger.py:42] Received request cmpl-1000f983035a494eadd763750a9b186f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:21 [async_llm.py:261] Added request cmpl-1000f983035a494eadd763750a9b186f-0. INFO 03-01 19:16:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:16:22 [logger.py:42] Received request cmpl-8116a721bded4820b306d3518f2f914e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:22 [async_llm.py:261] Added request cmpl-8116a721bded4820b306d3518f2f914e-0. INFO 03-01 19:16:23 [logger.py:42] Received request cmpl-628bebf577d84ce1ab012330bc910c21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:23 [async_llm.py:261] Added request cmpl-628bebf577d84ce1ab012330bc910c21-0. INFO 03-01 19:16:24 [logger.py:42] Received request cmpl-c9bfdc65d146477ea3531c086ceef815-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:24 [async_llm.py:261] Added request cmpl-c9bfdc65d146477ea3531c086ceef815-0. INFO 03-01 19:16:25 [logger.py:42] Received request cmpl-a5dbf5b2b0484c21a833651d405a85a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:25 [async_llm.py:261] Added request cmpl-a5dbf5b2b0484c21a833651d405a85a9-0. INFO 03-01 19:16:27 [logger.py:42] Received request cmpl-5168c5387030459ba3714a5e3a32f8c4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:27 [async_llm.py:261] Added request cmpl-5168c5387030459ba3714a5e3a32f8c4-0. INFO 03-01 19:16:28 [logger.py:42] Received request cmpl-f95f3fda1f774454bfbe0db457abb221-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:28 [async_llm.py:261] Added request cmpl-f95f3fda1f774454bfbe0db457abb221-0. INFO 03-01 19:16:29 [logger.py:42] Received request cmpl-178722b47976451da7d1ab77a9168862-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:29 [async_llm.py:261] Added request cmpl-178722b47976451da7d1ab77a9168862-0. INFO 03-01 19:16:30 [logger.py:42] Received request cmpl-eebc9774a3d04302a6359a5763f73989-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:30 [async_llm.py:261] Added request cmpl-eebc9774a3d04302a6359a5763f73989-0. INFO 03-01 19:16:31 [logger.py:42] Received request cmpl-fc0597208c66449eb30dd54b40666024-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:31 [async_llm.py:261] Added request cmpl-fc0597208c66449eb30dd54b40666024-0. INFO 03-01 19:16:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:16:32 [logger.py:42] Received request cmpl-6ac2d911697547d0b98015e505fb23c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:32 [async_llm.py:261] Added request cmpl-6ac2d911697547d0b98015e505fb23c5-0. INFO 03-01 19:16:33 [logger.py:42] Received request cmpl-5c7ebfa898cd482b9144a05d6b1a5934-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:33 [async_llm.py:261] Added request cmpl-5c7ebfa898cd482b9144a05d6b1a5934-0. INFO 03-01 19:16:35 [logger.py:42] Received request cmpl-8b4c2dcd2cfe4a4aa0ed21350466784b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:35 [async_llm.py:261] Added request cmpl-8b4c2dcd2cfe4a4aa0ed21350466784b-0. INFO 03-01 19:16:36 [logger.py:42] Received request cmpl-5547caced0f54f87b818ccd2e80b7ba8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:36 [async_llm.py:261] Added request cmpl-5547caced0f54f87b818ccd2e80b7ba8-0. INFO 03-01 19:16:37 [logger.py:42] Received request cmpl-332f55473387434686e1c6f98ee23275-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:37 [async_llm.py:261] Added request cmpl-332f55473387434686e1c6f98ee23275-0. INFO 03-01 19:16:38 [logger.py:42] Received request cmpl-996805f613b2428d9751ee4a851a52c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:38 [async_llm.py:261] Added request cmpl-996805f613b2428d9751ee4a851a52c8-0. INFO 03-01 19:16:39 [logger.py:42] Received request cmpl-167e4c9033d84294a46ca8f9b641eab6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:39 [async_llm.py:261] Added request cmpl-167e4c9033d84294a46ca8f9b641eab6-0. INFO 03-01 19:16:40 [logger.py:42] Received request cmpl-4c03ed2ea9864caba50a8d3d10cd9374-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:40 [async_llm.py:261] Added request cmpl-4c03ed2ea9864caba50a8d3d10cd9374-0. INFO 03-01 19:16:42 [logger.py:42] Received request cmpl-0a3572ad7cf3418c8f3be86511341cbe-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:42 [async_llm.py:261] Added request cmpl-0a3572ad7cf3418c8f3be86511341cbe-0. INFO 03-01 19:16:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:16:43 [logger.py:42] Received request cmpl-b8021cbad4b04691913036d2437b5a84-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:43 [async_llm.py:261] Added request cmpl-b8021cbad4b04691913036d2437b5a84-0. INFO 03-01 19:16:44 [logger.py:42] Received request cmpl-272d993af7564b24aa43dd75a6a29433-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:44 [async_llm.py:261] Added request cmpl-272d993af7564b24aa43dd75a6a29433-0. INFO 03-01 19:16:45 [logger.py:42] Received request cmpl-6ae504ba729d4a58831d20392529a3c5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:45 [async_llm.py:261] Added request cmpl-6ae504ba729d4a58831d20392529a3c5-0. INFO 03-01 19:16:46 [logger.py:42] Received request cmpl-34cd1b8079404fbcb5c92a8223e252a3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:46 [async_llm.py:261] Added request cmpl-34cd1b8079404fbcb5c92a8223e252a3-0. INFO 03-01 19:16:47 [logger.py:42] Received request cmpl-84718adb162b48448b4b44e2c5ef0355-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:47 [async_llm.py:261] Added request cmpl-84718adb162b48448b4b44e2c5ef0355-0. INFO 03-01 19:16:49 [logger.py:42] Received request cmpl-82f34fc46be54d1da6c797b21dcd958e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:49 [async_llm.py:261] Added request cmpl-82f34fc46be54d1da6c797b21dcd958e-0. INFO 03-01 19:16:50 [logger.py:42] Received request cmpl-3272fbccfa264e1f88ef86f72b07c75f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:50 [async_llm.py:261] Added request cmpl-3272fbccfa264e1f88ef86f72b07c75f-0. INFO 03-01 19:16:51 [logger.py:42] Received request cmpl-d442be2e839140488a920d37d4e0578d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:51 [async_llm.py:261] Added request cmpl-d442be2e839140488a920d37d4e0578d-0. INFO 03-01 19:16:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:16:52 [logger.py:42] Received request cmpl-01c9aef2a22540bab01f89e7b73a0e6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:52 [async_llm.py:261] Added request cmpl-01c9aef2a22540bab01f89e7b73a0e6e-0. INFO 03-01 19:16:53 [logger.py:42] Received request cmpl-ae237a00ada04bcd99fc65480229eeae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:53 [async_llm.py:261] Added request cmpl-ae237a00ada04bcd99fc65480229eeae-0. INFO 03-01 19:16:54 [logger.py:42] Received request cmpl-ec3bf50e088548749ba80d8a96cb5304-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:54 [async_llm.py:261] Added request cmpl-ec3bf50e088548749ba80d8a96cb5304-0. INFO 03-01 19:16:56 [logger.py:42] Received request cmpl-2282ceac2eb74769a09b855fc25ff698-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:56 [async_llm.py:261] Added request cmpl-2282ceac2eb74769a09b855fc25ff698-0. INFO 03-01 19:16:57 [logger.py:42] Received request cmpl-471e8c21b1fa4338942fa49fe07f67a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:57 [async_llm.py:261] Added request cmpl-471e8c21b1fa4338942fa49fe07f67a9-0. INFO 03-01 19:16:58 [logger.py:42] Received request cmpl-eedb72c3892a466f80cf272dd0df673c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:58 [async_llm.py:261] Added request cmpl-eedb72c3892a466f80cf272dd0df673c-0. INFO 03-01 19:16:59 [logger.py:42] Received request cmpl-ce6764f391344ac0990ca064335a95b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:16:59 [async_llm.py:261] Added request cmpl-ce6764f391344ac0990ca064335a95b1-0. INFO 03-01 19:17:00 [logger.py:42] Received request cmpl-880999c7cb20445c89d3ecc231cec3a8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:00 [async_llm.py:261] Added request cmpl-880999c7cb20445c89d3ecc231cec3a8-0. INFO 03-01 19:17:02 [logger.py:42] Received request cmpl-c7018390cabe44729b18b316fbe2a186-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:02 [async_llm.py:261] Added request cmpl-c7018390cabe44729b18b316fbe2a186-0. INFO 03-01 19:17:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:17:03 [logger.py:42] Received request cmpl-25f3e939feb94cdcab3bfb39dadfb7c0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:03 [async_llm.py:261] Added request cmpl-25f3e939feb94cdcab3bfb39dadfb7c0-0. INFO 03-01 19:17:04 [logger.py:42] Received request cmpl-72e73fdd312244999990736675e2cbf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:04 [async_llm.py:261] Added request cmpl-72e73fdd312244999990736675e2cbf5-0. INFO 03-01 19:17:05 [logger.py:42] Received request cmpl-04337694e6ec45f08b28dad68cf9b1d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:05 [async_llm.py:261] Added request cmpl-04337694e6ec45f08b28dad68cf9b1d8-0. INFO 03-01 19:17:06 [logger.py:42] Received request cmpl-d6c87a920c5e420db2c3bba5078177d5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:06 [async_llm.py:261] Added request cmpl-d6c87a920c5e420db2c3bba5078177d5-0. INFO 03-01 19:17:07 [logger.py:42] Received request cmpl-136561b73c5640dfb46355682686c7a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:07 [async_llm.py:261] Added request cmpl-136561b73c5640dfb46355682686c7a7-0. INFO 03-01 19:17:09 [logger.py:42] Received request cmpl-8b3d2b83e7fb43ed9cc4b4872d439edb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:09 [async_llm.py:261] Added request cmpl-8b3d2b83e7fb43ed9cc4b4872d439edb-0. INFO 03-01 19:17:10 [logger.py:42] Received request cmpl-98d70b4b04c3409daafaa9b9fcec1235-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:10 [async_llm.py:261] Added request cmpl-98d70b4b04c3409daafaa9b9fcec1235-0. INFO 03-01 19:17:11 [logger.py:42] Received request cmpl-ae46fc57304e4aacb3fb65e0f9935c45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:11 [async_llm.py:261] Added request cmpl-ae46fc57304e4aacb3fb65e0f9935c45-0. INFO 03-01 19:17:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:17:12 [logger.py:42] Received request cmpl-0e40faf3d80a40e6be53ac653a80a6bc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:12 [async_llm.py:261] Added request cmpl-0e40faf3d80a40e6be53ac653a80a6bc-0. INFO 03-01 19:17:13 [logger.py:42] Received request cmpl-9b6f94cfdf804377bd859e7b67c49ff5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:13 [async_llm.py:261] Added request cmpl-9b6f94cfdf804377bd859e7b67c49ff5-0. INFO 03-01 19:17:14 [logger.py:42] Received request cmpl-d569801aae11468388d75634e1b4c570-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:14 [async_llm.py:261] Added request cmpl-d569801aae11468388d75634e1b4c570-0. INFO 03-01 19:17:16 [logger.py:42] Received request cmpl-0323bf8c8ce042cb9e11b626f27c634b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:16 [async_llm.py:261] Added request cmpl-0323bf8c8ce042cb9e11b626f27c634b-0. INFO 03-01 19:17:17 [logger.py:42] Received request cmpl-8fac8edfbeb7489ab7e5ce0454da35ed-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:17 [async_llm.py:261] Added request cmpl-8fac8edfbeb7489ab7e5ce0454da35ed-0. INFO 03-01 19:17:18 [logger.py:42] Received request cmpl-980f1731fe674ed59262c17f4986b3cb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:18 [async_llm.py:261] Added request cmpl-980f1731fe674ed59262c17f4986b3cb-0. INFO 03-01 19:17:19 [logger.py:42] Received request cmpl-7b001151dd1e4b6e94ce436d6115e8f9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:19 [async_llm.py:261] Added request cmpl-7b001151dd1e4b6e94ce436d6115e8f9-0. INFO 03-01 19:17:20 [logger.py:42] Received request cmpl-bd0e4ad0376c4bb58c84062212982ba3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:20 [async_llm.py:261] Added request cmpl-bd0e4ad0376c4bb58c84062212982ba3-0. INFO 03-01 19:17:21 [logger.py:42] Received request cmpl-862862fc341249778b86f5f8a0b0c565-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:21 [async_llm.py:261] Added request cmpl-862862fc341249778b86f5f8a0b0c565-0. INFO 03-01 19:17:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:17:23 [logger.py:42] Received request cmpl-9fd2f7d0e48740e79ed0353f124df40b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:23 [async_llm.py:261] Added request cmpl-9fd2f7d0e48740e79ed0353f124df40b-0. INFO 03-01 19:17:24 [logger.py:42] Received request cmpl-208d2e4394a0412c982cc8e81c59b7fc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:24 [async_llm.py:261] Added request cmpl-208d2e4394a0412c982cc8e81c59b7fc-0. INFO 03-01 19:17:25 [logger.py:42] Received request cmpl-ec9dba2b68424dca8f35d921322db2e8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:25 [async_llm.py:261] Added request cmpl-ec9dba2b68424dca8f35d921322db2e8-0. INFO 03-01 19:17:26 [logger.py:42] Received request cmpl-f567335085b44b2fabfd50855af6c058-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:26 [async_llm.py:261] Added request cmpl-f567335085b44b2fabfd50855af6c058-0. INFO 03-01 19:17:27 [logger.py:42] Received request cmpl-25d30798650046e5808b925b4e9e8e7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:27 [async_llm.py:261] Added request cmpl-25d30798650046e5808b925b4e9e8e7a-0. INFO 03-01 19:17:28 [logger.py:42] Received request cmpl-5868bfafd64b477e9b48c21ea626095a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:28 [async_llm.py:261] Added request cmpl-5868bfafd64b477e9b48c21ea626095a-0. INFO 03-01 19:17:29 [logger.py:42] Received request cmpl-da50c99f505f44a0b33d7a57944ac938-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:29 [async_llm.py:261] Added request cmpl-da50c99f505f44a0b33d7a57944ac938-0. INFO 03-01 19:17:31 [logger.py:42] Received request cmpl-790ad512a82f4d14b00ba0c253af4a58-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:31 [async_llm.py:261] Added request cmpl-790ad512a82f4d14b00ba0c253af4a58-0. INFO 03-01 19:17:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:17:32 [logger.py:42] Received request cmpl-edd81642586b432ea1a1269a7f44bcf5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:32 [async_llm.py:261] Added request cmpl-edd81642586b432ea1a1269a7f44bcf5-0. INFO 03-01 19:17:33 [logger.py:42] Received request cmpl-1c0f4c4e2bd34d91b8082a76e9aea860-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:33 [async_llm.py:261] Added request cmpl-1c0f4c4e2bd34d91b8082a76e9aea860-0. INFO 03-01 19:17:34 [logger.py:42] Received request cmpl-23fafd30445541ac99cb713131d9d07a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:34 [async_llm.py:261] Added request cmpl-23fafd30445541ac99cb713131d9d07a-0. INFO 03-01 19:17:35 [logger.py:42] Received request cmpl-7f87d9a37c46455d8288de9c953958ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:35 [async_llm.py:261] Added request cmpl-7f87d9a37c46455d8288de9c953958ba-0. INFO 03-01 19:17:36 [logger.py:42] Received request cmpl-9f3c06a95b8849398efa994cc909a26d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:36 [async_llm.py:261] Added request cmpl-9f3c06a95b8849398efa994cc909a26d-0. INFO 03-01 19:17:38 [logger.py:42] Received request cmpl-2ed114d674b44c5fb19ad75fd9db3ff0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:38 [async_llm.py:261] Added request cmpl-2ed114d674b44c5fb19ad75fd9db3ff0-0. INFO 03-01 19:17:39 [logger.py:42] Received request cmpl-eb5c7212928d4bc6996abdfb19f608cf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:39 [async_llm.py:261] Added request cmpl-eb5c7212928d4bc6996abdfb19f608cf-0. INFO 03-01 19:17:40 [logger.py:42] Received request cmpl-7c160dddc2ee493e8f7b139690f91dac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:40 [async_llm.py:261] Added request cmpl-7c160dddc2ee493e8f7b139690f91dac-0. INFO 03-01 19:17:41 [logger.py:42] Received request cmpl-dfbc2a944bbb49a2a25e4d53b819ed00-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:41 [async_llm.py:261] Added request cmpl-dfbc2a944bbb49a2a25e4d53b819ed00-0. INFO 03-01 19:17:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:17:42 [logger.py:42] Received request cmpl-832fc967563f4b408574876dbbc8adb3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:42 [async_llm.py:261] Added request cmpl-832fc967563f4b408574876dbbc8adb3-0. INFO 03-01 19:17:43 [logger.py:42] Received request cmpl-41671ac5920e4729b4d113aaedda08d3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:43 [async_llm.py:261] Added request cmpl-41671ac5920e4729b4d113aaedda08d3-0. INFO 03-01 19:17:45 [logger.py:42] Received request cmpl-e666949188cd4fab868ae5d28ca89292-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:45 [async_llm.py:261] Added request cmpl-e666949188cd4fab868ae5d28ca89292-0. INFO 03-01 19:17:46 [logger.py:42] Received request cmpl-5eeeb5c2c86241bb8e56a6c86f88ac7d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:46 [async_llm.py:261] Added request cmpl-5eeeb5c2c86241bb8e56a6c86f88ac7d-0. INFO 03-01 19:17:47 [logger.py:42] Received request cmpl-224029905cad4e5c840405acef8e2806-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:47 [async_llm.py:261] Added request cmpl-224029905cad4e5c840405acef8e2806-0. INFO 03-01 19:17:48 [logger.py:42] Received request cmpl-504a5fbf59354ace843213e3205dc731-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:48 [async_llm.py:261] Added request cmpl-504a5fbf59354ace843213e3205dc731-0. INFO 03-01 19:17:49 [logger.py:42] Received request cmpl-3b42bcc2c69b4fe7beda0e34599113d8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:49 [async_llm.py:261] Added request cmpl-3b42bcc2c69b4fe7beda0e34599113d8-0. INFO 03-01 19:17:50 [logger.py:42] Received request cmpl-913cec9b4d7b4c8d9dcdef9d9e15d46c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:50 [async_llm.py:261] Added request cmpl-913cec9b4d7b4c8d9dcdef9d9e15d46c-0. INFO 03-01 19:17:52 [logger.py:42] Received request cmpl-bd838f8fb6c94395b98aa2416a17cf23-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:52 [async_llm.py:261] Added request cmpl-bd838f8fb6c94395b98aa2416a17cf23-0. INFO 03-01 19:17:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:17:53 [logger.py:42] Received request cmpl-440a6d8b350f46d0a88ab5e5a9294c29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:53 [async_llm.py:261] Added request cmpl-440a6d8b350f46d0a88ab5e5a9294c29-0. INFO 03-01 19:17:54 [logger.py:42] Received request cmpl-6592c6f454354b51a83f10976cf30205-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:54 [async_llm.py:261] Added request cmpl-6592c6f454354b51a83f10976cf30205-0. INFO 03-01 19:17:55 [logger.py:42] Received request cmpl-33d6df6ada334e07b12ccb27940674f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:55 [async_llm.py:261] Added request cmpl-33d6df6ada334e07b12ccb27940674f4-0. INFO 03-01 19:17:56 [logger.py:42] Received request cmpl-3f44dc70688f47639772bfa9a9e37d70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:56 [async_llm.py:261] Added request cmpl-3f44dc70688f47639772bfa9a9e37d70-0. INFO 03-01 19:17:57 [logger.py:42] Received request cmpl-49dab353b1a74d96bfc574f36143a52d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:57 [async_llm.py:261] Added request cmpl-49dab353b1a74d96bfc574f36143a52d-0. INFO 03-01 19:17:58 [logger.py:42] Received request cmpl-0b7edfee6e3f4a078f39d2bafc55af7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:17:58 [async_llm.py:261] Added request cmpl-0b7edfee6e3f4a078f39d2bafc55af7e-0. INFO 03-01 19:18:00 [logger.py:42] Received request cmpl-2c03f7e1631d4989b6631a01162244ff-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:00 [async_llm.py:261] Added request cmpl-2c03f7e1631d4989b6631a01162244ff-0. INFO 03-01 19:18:01 [logger.py:42] Received request cmpl-07a0cfba74404461abb8e7e71cb4d955-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:01 [async_llm.py:261] Added request cmpl-07a0cfba74404461abb8e7e71cb4d955-0. INFO 03-01 19:18:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:18:02 [logger.py:42] Received request cmpl-b977dd59285845c599fefa8482bb25f7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:02 [async_llm.py:261] Added request cmpl-b977dd59285845c599fefa8482bb25f7-0. INFO 03-01 19:18:03 [logger.py:42] Received request cmpl-de253f6d2fdb4ebdac7d7a1ed81e5e74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:03 [async_llm.py:261] Added request cmpl-de253f6d2fdb4ebdac7d7a1ed81e5e74-0. INFO 03-01 19:18:04 [logger.py:42] Received request cmpl-e717391fe9384c4aa227f26d68979790-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:04 [async_llm.py:261] Added request cmpl-e717391fe9384c4aa227f26d68979790-0. INFO 03-01 19:18:05 [logger.py:42] Received request cmpl-a955cc3f6fad4f8e9c3620f892115cbf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:05 [async_llm.py:261] Added request cmpl-a955cc3f6fad4f8e9c3620f892115cbf-0. INFO 03-01 19:18:07 [logger.py:42] Received request cmpl-e931fefb23ae40099f0edf2fc38df6ae-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:07 [async_llm.py:261] Added request cmpl-e931fefb23ae40099f0edf2fc38df6ae-0. INFO 03-01 19:18:08 [logger.py:42] Received request cmpl-57e6e545de3c460588170f11f55c0b06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:08 [async_llm.py:261] Added request cmpl-57e6e545de3c460588170f11f55c0b06-0. INFO 03-01 19:18:09 [logger.py:42] Received request cmpl-2eb68d99960c471a9586f0ff92af49e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:09 [async_llm.py:261] Added request cmpl-2eb68d99960c471a9586f0ff92af49e6-0. INFO 03-01 19:18:10 [logger.py:42] Received request cmpl-e4ea1e1466a9415592b9f179f7bd2870-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:10 [async_llm.py:261] Added request cmpl-e4ea1e1466a9415592b9f179f7bd2870-0. INFO 03-01 19:18:11 [logger.py:42] Received request cmpl-20799b5eedf14e22a9afedd4a6a23598-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:11 [async_llm.py:261] Added request cmpl-20799b5eedf14e22a9afedd4a6a23598-0. INFO 03-01 19:18:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:18:12 [logger.py:42] Received request cmpl-597337aa0eac4da2bfbb7c7d0060035d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:12 [async_llm.py:261] Added request cmpl-597337aa0eac4da2bfbb7c7d0060035d-0. INFO 03-01 19:18:14 [logger.py:42] Received request cmpl-7f50cac27a7442fbab8de187d724cf6c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:14 [async_llm.py:261] Added request cmpl-7f50cac27a7442fbab8de187d724cf6c-0. INFO 03-01 19:18:15 [logger.py:42] Received request cmpl-287aa3795d07424a8f09545ed8c15142-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:15 [async_llm.py:261] Added request cmpl-287aa3795d07424a8f09545ed8c15142-0. INFO 03-01 19:18:16 [logger.py:42] Received request cmpl-4fa8edcc71554fbdb4e1eda80198c4ca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:16 [async_llm.py:261] Added request cmpl-4fa8edcc71554fbdb4e1eda80198c4ca-0. INFO 03-01 19:18:17 [logger.py:42] Received request cmpl-92c4353eb94f4ea6b39638301f2b53f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:17 [async_llm.py:261] Added request cmpl-92c4353eb94f4ea6b39638301f2b53f8-0. INFO 03-01 19:18:18 [logger.py:42] Received request cmpl-792ed2e2a36641d8936c5761508e21f0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:18 [async_llm.py:261] Added request cmpl-792ed2e2a36641d8936c5761508e21f0-0. INFO 03-01 19:18:19 [logger.py:42] Received request cmpl-b81c375feb414d358772c4b4f3e0f55e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:19 [async_llm.py:261] Added request cmpl-b81c375feb414d358772c4b4f3e0f55e-0. INFO 03-01 19:18:20 [logger.py:42] Received request cmpl-72064a877ef84f3094610e7613f0b843-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:20 [async_llm.py:261] Added request cmpl-72064a877ef84f3094610e7613f0b843-0. INFO 03-01 19:18:22 [logger.py:42] Received request cmpl-d021f32f04f34f4eb5ad17825ffe7be7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:22 [async_llm.py:261] Added request cmpl-d021f32f04f34f4eb5ad17825ffe7be7-0. INFO 03-01 19:18:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.4 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:18:23 [logger.py:42] Received request cmpl-c5cb7aca8ff0487c90ab4237a8762ab6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:23 [async_llm.py:261] Added request cmpl-c5cb7aca8ff0487c90ab4237a8762ab6-0. INFO 03-01 19:18:24 [logger.py:42] Received request cmpl-7d028e4439e344de87e77ee3db233c70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:24 [async_llm.py:261] Added request cmpl-7d028e4439e344de87e77ee3db233c70-0. INFO 03-01 19:18:25 [logger.py:42] Received request cmpl-295617631be744ffbc4e0706c1d4ff66-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:25 [async_llm.py:261] Added request cmpl-295617631be744ffbc4e0706c1d4ff66-0. INFO 03-01 19:18:26 [logger.py:42] Received request cmpl-36ddafdf2110474bb57a99be094fc710-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:26 [async_llm.py:261] Added request cmpl-36ddafdf2110474bb57a99be094fc710-0. INFO 03-01 19:18:27 [logger.py:42] Received request cmpl-760e388cfcae475f99fcc0079b99da50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:27 [async_llm.py:261] Added request cmpl-760e388cfcae475f99fcc0079b99da50-0. INFO 03-01 19:18:29 [logger.py:42] Received request cmpl-503575038a17485b96b3e779a98a913a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:29 [async_llm.py:261] Added request cmpl-503575038a17485b96b3e779a98a913a-0. INFO 03-01 19:18:30 [logger.py:42] Received request cmpl-2b8a84614152472fb4f2462d9dd42860-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:30 [async_llm.py:261] Added request cmpl-2b8a84614152472fb4f2462d9dd42860-0. INFO 03-01 19:18:31 [logger.py:42] Received request cmpl-79fd6961392d41c6b884986b5489cfd1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:31 [async_llm.py:261] Added request cmpl-79fd6961392d41c6b884986b5489cfd1-0. INFO 03-01 19:18:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.1 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:18:32 [logger.py:42] Received request cmpl-4605a6e841744b4a8b4d103ceaa174c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:32 [async_llm.py:261] Added request cmpl-4605a6e841744b4a8b4d103ceaa174c1-0. INFO 03-01 19:18:33 [logger.py:42] Received request cmpl-33e0381228294297b2076921fc20874c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:33 [async_llm.py:261] Added request cmpl-33e0381228294297b2076921fc20874c-0. INFO 03-01 19:18:34 [logger.py:42] Received request cmpl-5caf6ee09d1946ee877e78156b2b94b2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:34 [async_llm.py:261] Added request cmpl-5caf6ee09d1946ee877e78156b2b94b2-0. INFO 03-01 19:18:36 [logger.py:42] Received request cmpl-32392d2eb53d4073bfa72eb1fea3a81d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:36 [async_llm.py:261] Added request cmpl-32392d2eb53d4073bfa72eb1fea3a81d-0. INFO 03-01 19:18:37 [logger.py:42] Received request cmpl-4a6a2e7b31b647dcbcfacbf9b305a829-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:37 [async_llm.py:261] Added request cmpl-4a6a2e7b31b647dcbcfacbf9b305a829-0. INFO 03-01 19:18:38 [logger.py:42] Received request cmpl-896ffb5b34f045658adbe4e050dfe8df-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:38 [async_llm.py:261] Added request cmpl-896ffb5b34f045658adbe4e050dfe8df-0. INFO 03-01 19:18:39 [logger.py:42] Received request cmpl-becc9dd1404843dea34b3e7736b98013-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:39 [async_llm.py:261] Added request cmpl-becc9dd1404843dea34b3e7736b98013-0. INFO 03-01 19:18:40 [logger.py:42] Received request cmpl-1ae5e39fe990489481981e2a843e70c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:40 [async_llm.py:261] Added request cmpl-1ae5e39fe990489481981e2a843e70c7-0. INFO 03-01 19:18:41 [logger.py:42] Received request cmpl-1854da07a6cf4fc394b07166c8b00750-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:41 [async_llm.py:261] Added request cmpl-1854da07a6cf4fc394b07166c8b00750-0. INFO 03-01 19:18:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:18:42 [logger.py:42] Received request cmpl-3f2bb73a08fe4dc585efc2b842d89213-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:42 [async_llm.py:261] Added request cmpl-3f2bb73a08fe4dc585efc2b842d89213-0. INFO 03-01 19:18:44 [logger.py:42] Received request cmpl-9534905055a9484fb8da4524ba3947d0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:44 [async_llm.py:261] Added request cmpl-9534905055a9484fb8da4524ba3947d0-0. INFO 03-01 19:18:45 [logger.py:42] Received request cmpl-9d7e654fb3db4a32bd95b03334713c0c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:45 [async_llm.py:261] Added request cmpl-9d7e654fb3db4a32bd95b03334713c0c-0. INFO 03-01 19:18:46 [logger.py:42] Received request cmpl-c7bd33e3372b4e49bf6fd5907c0d7f74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:46 [async_llm.py:261] Added request cmpl-c7bd33e3372b4e49bf6fd5907c0d7f74-0. INFO 03-01 19:18:47 [logger.py:42] Received request cmpl-f4aba0cc71a248f1b74467253dc8903f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:47 [async_llm.py:261] Added request cmpl-f4aba0cc71a248f1b74467253dc8903f-0. INFO 03-01 19:18:48 [logger.py:42] Received request cmpl-04a65359be2a4f1ca07069eb0372125b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:48 [async_llm.py:261] Added request cmpl-04a65359be2a4f1ca07069eb0372125b-0. INFO 03-01 19:18:49 [logger.py:42] Received request cmpl-cff5fb4e657449aaa05268a887681b45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:49 [async_llm.py:261] Added request cmpl-cff5fb4e657449aaa05268a887681b45-0. INFO 03-01 19:18:51 [logger.py:42] Received request cmpl-669d158c848b4dca9e080092fb470a33-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:51 [async_llm.py:261] Added request cmpl-669d158c848b4dca9e080092fb470a33-0. INFO 03-01 19:18:52 [logger.py:42] Received request cmpl-dd479505d3c941ec8026f0d07433481f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:52 [async_llm.py:261] Added request cmpl-dd479505d3c941ec8026f0d07433481f-0. INFO 03-01 19:18:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:18:53 [logger.py:42] Received request cmpl-ad0cd4afe53642369c6f3f1ebfcfca55-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:53 [async_llm.py:261] Added request cmpl-ad0cd4afe53642369c6f3f1ebfcfca55-0. INFO 03-01 19:18:54 [logger.py:42] Received request cmpl-45c640bb3ce14b47a881b65b8ac8d32f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:54 [async_llm.py:261] Added request cmpl-45c640bb3ce14b47a881b65b8ac8d32f-0. INFO 03-01 19:18:55 [logger.py:42] Received request cmpl-69b8a05235784ee08db45ca86ff41bd5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:55 [async_llm.py:261] Added request cmpl-69b8a05235784ee08db45ca86ff41bd5-0. INFO 03-01 19:18:56 [logger.py:42] Received request cmpl-cd84646f91de44ceb3daff1b83059d1a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:56 [async_llm.py:261] Added request cmpl-cd84646f91de44ceb3daff1b83059d1a-0. INFO 03-01 19:18:58 [logger.py:42] Received request cmpl-a84f5eb3096c41f8be7fdccacda43711-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:58 [async_llm.py:261] Added request cmpl-a84f5eb3096c41f8be7fdccacda43711-0. INFO 03-01 19:18:59 [logger.py:42] Received request cmpl-d082e41c397045f6a5484288fc621b28-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:18:59 [async_llm.py:261] Added request cmpl-d082e41c397045f6a5484288fc621b28-0. INFO 03-01 19:19:00 [logger.py:42] Received request cmpl-bd05f2bc67124cff8ea97a1708f86d93-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:00 [async_llm.py:261] Added request cmpl-bd05f2bc67124cff8ea97a1708f86d93-0. INFO 03-01 19:19:01 [logger.py:42] Received request cmpl-0b37f219701b4659b4b3781223e8bd12-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:01 [async_llm.py:261] Added request cmpl-0b37f219701b4659b4b3781223e8bd12-0. INFO 03-01 19:19:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:19:02 [logger.py:42] Received request cmpl-e6123c8f8dbe4137871e7f667501ef9f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:02 [async_llm.py:261] Added request cmpl-e6123c8f8dbe4137871e7f667501ef9f-0. INFO 03-01 19:19:03 [logger.py:42] Received request cmpl-febda5a90f7f40a89ea63794afc8d82f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:03 [async_llm.py:261] Added request cmpl-febda5a90f7f40a89ea63794afc8d82f-0. INFO 03-01 19:19:04 [logger.py:42] Received request cmpl-eee52433bfd643a7a3a2ac1506f1b8c8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:04 [async_llm.py:261] Added request cmpl-eee52433bfd643a7a3a2ac1506f1b8c8-0. INFO 03-01 19:19:06 [logger.py:42] Received request cmpl-ac8334910daa4ac099fa608727852474-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:06 [async_llm.py:261] Added request cmpl-ac8334910daa4ac099fa608727852474-0. INFO 03-01 19:19:07 [logger.py:42] Received request cmpl-f7c56e0cb8c04a029e19a90669bdb4b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:07 [async_llm.py:261] Added request cmpl-f7c56e0cb8c04a029e19a90669bdb4b5-0. INFO 03-01 19:19:08 [logger.py:42] Received request cmpl-58b859377d064a8ebf76982d1fdad833-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:08 [async_llm.py:261] Added request cmpl-58b859377d064a8ebf76982d1fdad833-0. INFO 03-01 19:19:09 [logger.py:42] Received request cmpl-c1cd0b85c63141f2aec06a42747b422a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:09 [async_llm.py:261] Added request cmpl-c1cd0b85c63141f2aec06a42747b422a-0. INFO 03-01 19:19:10 [logger.py:42] Received request cmpl-fe7ebf32b7124d5198121418f67dafcc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:10 [async_llm.py:261] Added request cmpl-fe7ebf32b7124d5198121418f67dafcc-0. INFO 03-01 19:19:11 [logger.py:42] Received request cmpl-063ccec19b5c428295863e0499513ce5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:11 [async_llm.py:261] Added request cmpl-063ccec19b5c428295863e0499513ce5-0. INFO 03-01 19:19:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:19:13 [logger.py:42] Received request cmpl-74078af09c604adfbd4ed283f107aa45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:13 [async_llm.py:261] Added request cmpl-74078af09c604adfbd4ed283f107aa45-0. INFO 03-01 19:19:14 [logger.py:42] Received request cmpl-b2af2b5482d84aacb1c97d6251cf80e7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:14 [async_llm.py:261] Added request cmpl-b2af2b5482d84aacb1c97d6251cf80e7-0. INFO 03-01 19:19:15 [logger.py:42] Received request cmpl-ee40bcf0f5f44d1bb1b5dda8c2019164-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:15 [async_llm.py:261] Added request cmpl-ee40bcf0f5f44d1bb1b5dda8c2019164-0. INFO 03-01 19:19:16 [logger.py:42] Received request cmpl-d9aebef0525849489f65be92e267ab4c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:16 [async_llm.py:261] Added request cmpl-d9aebef0525849489f65be92e267ab4c-0. INFO 03-01 19:19:17 [logger.py:42] Received request cmpl-3db1d4ab4a4c4502ab27a3921d558763-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:17 [async_llm.py:261] Added request cmpl-3db1d4ab4a4c4502ab27a3921d558763-0. INFO 03-01 19:19:18 [logger.py:42] Received request cmpl-64dca453b5b940678524b6431a4c3989-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:18 [async_llm.py:261] Added request cmpl-64dca453b5b940678524b6431a4c3989-0. INFO 03-01 19:19:20 [logger.py:42] Received request cmpl-f85c67b85fb244d0b76363f994d64257-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:20 [async_llm.py:261] Added request cmpl-f85c67b85fb244d0b76363f994d64257-0. INFO 03-01 19:19:21 [logger.py:42] Received request cmpl-c51439dae01a4499a1ba8799c3d462ac-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:21 [async_llm.py:261] Added request cmpl-c51439dae01a4499a1ba8799c3d462ac-0. INFO 03-01 19:19:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:19:22 [logger.py:42] Received request cmpl-9bbfee32da684d04bdb7b772d0cdd7c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:22 [async_llm.py:261] Added request cmpl-9bbfee32da684d04bdb7b772d0cdd7c9-0. INFO 03-01 19:19:23 [logger.py:42] Received request cmpl-0beec200524a4cb39fc73adafd93983d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:23 [async_llm.py:261] Added request cmpl-0beec200524a4cb39fc73adafd93983d-0. INFO 03-01 19:19:24 [logger.py:42] Received request cmpl-fdb4ef922f554502ae7725da0349b141-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:24 [async_llm.py:261] Added request cmpl-fdb4ef922f554502ae7725da0349b141-0. INFO 03-01 19:19:25 [logger.py:42] Received request cmpl-83c47b7b77364b4aa11a0858c0f99279-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:25 [async_llm.py:261] Added request cmpl-83c47b7b77364b4aa11a0858c0f99279-0. INFO 03-01 19:19:26 [logger.py:42] Received request cmpl-6e008fc1bd1d49daba77ca8ce8816ee2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:26 [async_llm.py:261] Added request cmpl-6e008fc1bd1d49daba77ca8ce8816ee2-0. INFO 03-01 19:19:28 [logger.py:42] Received request cmpl-fa94539487b54a94b64d60c5b7d75f8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:28 [async_llm.py:261] Added request cmpl-fa94539487b54a94b64d60c5b7d75f8e-0. INFO 03-01 19:19:29 [logger.py:42] Received request cmpl-7a7a630d886241f6aff668142b1fd884-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:29 [async_llm.py:261] Added request cmpl-7a7a630d886241f6aff668142b1fd884-0. INFO 03-01 19:19:30 [logger.py:42] Received request cmpl-b30ab5db24f949df86b3479250c00837-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:30 [async_llm.py:261] Added request cmpl-b30ab5db24f949df86b3479250c00837-0. INFO 03-01 19:19:31 [logger.py:42] Received request cmpl-19aae5392b3040be8ff1321706a9e9db-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:31 [async_llm.py:261] Added request cmpl-19aae5392b3040be8ff1321706a9e9db-0. INFO 03-01 19:19:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:19:32 [logger.py:42] Received request cmpl-be220d4bd80d4e0e922886b81231c467-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:32 [async_llm.py:261] Added request cmpl-be220d4bd80d4e0e922886b81231c467-0. INFO 03-01 19:19:33 [logger.py:42] Received request cmpl-cbea55db49dc4cda9042efe6fe8ce164-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:33 [async_llm.py:261] Added request cmpl-cbea55db49dc4cda9042efe6fe8ce164-0. INFO 03-01 19:19:35 [logger.py:42] Received request cmpl-b74fd931bdef4f078b84bc5a9f3b5826-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:35 [async_llm.py:261] Added request cmpl-b74fd931bdef4f078b84bc5a9f3b5826-0. INFO 03-01 19:19:36 [logger.py:42] Received request cmpl-1259b14af9834c2fbbc40fba7f0249a9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:36 [async_llm.py:261] Added request cmpl-1259b14af9834c2fbbc40fba7f0249a9-0. INFO 03-01 19:19:37 [logger.py:42] Received request cmpl-fa86c6c7224441f3b212c8be9ccec489-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:37 [async_llm.py:261] Added request cmpl-fa86c6c7224441f3b212c8be9ccec489-0. INFO 03-01 19:19:38 [logger.py:42] Received request cmpl-9bcccad24f1a4a58881dbf76ddb53b61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:38 [async_llm.py:261] Added request cmpl-9bcccad24f1a4a58881dbf76ddb53b61-0. INFO 03-01 19:19:39 [logger.py:42] Received request cmpl-dee4c61e374d4b80b896dfc7e58d713b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:39 [async_llm.py:261] Added request cmpl-dee4c61e374d4b80b896dfc7e58d713b-0. INFO 03-01 19:19:40 [logger.py:42] Received request cmpl-44b54d0b082c4a43be495787df2b20b9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:40 [async_llm.py:261] Added request cmpl-44b54d0b082c4a43be495787df2b20b9-0. INFO 03-01 19:19:42 [logger.py:42] Received request cmpl-6636dc55ee794d65819fbcec3cb2959e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:42 [async_llm.py:261] Added request cmpl-6636dc55ee794d65819fbcec3cb2959e-0. INFO 03-01 19:19:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:19:43 [logger.py:42] Received request cmpl-e459bec7b30242589624a058f5fe2d29-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:43 [async_llm.py:261] Added request cmpl-e459bec7b30242589624a058f5fe2d29-0. INFO 03-01 19:19:44 [logger.py:42] Received request cmpl-f4c8c96552c546f0b6d103762ac59c03-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:44 [async_llm.py:261] Added request cmpl-f4c8c96552c546f0b6d103762ac59c03-0. INFO 03-01 19:19:45 [logger.py:42] Received request cmpl-bff5df9bd78949929b9aa62bbec609f8-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:45 [async_llm.py:261] Added request cmpl-bff5df9bd78949929b9aa62bbec609f8-0. INFO 03-01 19:19:46 [logger.py:42] Received request cmpl-cac3bd5426254fa18b426926a66b637a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:46 [async_llm.py:261] Added request cmpl-cac3bd5426254fa18b426926a66b637a-0. INFO 03-01 19:19:47 [logger.py:42] Received request cmpl-9030c97733954c16be0cca2fb4036cd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:47 [async_llm.py:261] Added request cmpl-9030c97733954c16be0cca2fb4036cd7-0. INFO 03-01 19:19:49 [logger.py:42] Received request cmpl-f62df9f8d6fc46d8a5d6f4340e26ba4f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:49 [async_llm.py:261] Added request cmpl-f62df9f8d6fc46d8a5d6f4340e26ba4f-0. INFO 03-01 19:19:50 [logger.py:42] Received request cmpl-6e2ed32ae58d4170b72bf10892ca8926-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:50 [async_llm.py:261] Added request cmpl-6e2ed32ae58d4170b72bf10892ca8926-0. INFO 03-01 19:19:51 [logger.py:42] Received request cmpl-547f11f9135447c3bbbc6bd5ab49ac50-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:51 [async_llm.py:261] Added request cmpl-547f11f9135447c3bbbc6bd5ab49ac50-0. INFO 03-01 19:19:52 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:19:52 [logger.py:42] Received request cmpl-54fd7756d162414eb3d8bcd6ad2a16eb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:52 [async_llm.py:261] Added request cmpl-54fd7756d162414eb3d8bcd6ad2a16eb-0. INFO 03-01 19:19:53 [logger.py:42] Received request cmpl-31709b6bfd7345ed97c24ca3f945760f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:53 [async_llm.py:261] Added request cmpl-31709b6bfd7345ed97c24ca3f945760f-0. INFO 03-01 19:19:54 [logger.py:42] Received request cmpl-ff8ec310066b4b1ca88bc662c15a8a70-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:54 [async_llm.py:261] Added request cmpl-ff8ec310066b4b1ca88bc662c15a8a70-0. INFO 03-01 19:19:55 [logger.py:42] Received request cmpl-f63e2ba963c74719b1ad63557d0c7d8e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:55 [async_llm.py:261] Added request cmpl-f63e2ba963c74719b1ad63557d0c7d8e-0. INFO 03-01 19:19:57 [logger.py:42] Received request cmpl-0ad1b6b9cf7e40b29ffe074d4db4a987-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:57 [async_llm.py:261] Added request cmpl-0ad1b6b9cf7e40b29ffe074d4db4a987-0. INFO 03-01 19:19:58 [logger.py:42] Received request cmpl-12bc9309ab9d4e7fbdad1aa6917f3523-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:58 [async_llm.py:261] Added request cmpl-12bc9309ab9d4e7fbdad1aa6917f3523-0. INFO 03-01 19:19:59 [logger.py:42] Received request cmpl-3525188331db4cfc800dab9db2e2f5f4-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:19:59 [async_llm.py:261] Added request cmpl-3525188331db4cfc800dab9db2e2f5f4-0. INFO 03-01 19:20:00 [logger.py:42] Received request cmpl-e3fe244ed3f04109ae5507b92d453606-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:00 [async_llm.py:261] Added request cmpl-e3fe244ed3f04109ae5507b92d453606-0. INFO 03-01 19:20:01 [logger.py:42] Received request cmpl-c0d2904f33ac445cb1a75b61497dbab6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:01 [async_llm.py:261] Added request cmpl-c0d2904f33ac445cb1a75b61497dbab6-0. INFO 03-01 19:20:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:20:02 [logger.py:42] Received request cmpl-7a4fc47fcd864b2789873571766fb385-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:02 [async_llm.py:261] Added request cmpl-7a4fc47fcd864b2789873571766fb385-0. INFO 03-01 19:20:04 [logger.py:42] Received request cmpl-bb8a8084bd5a49e692a3e7dc300351b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:04 [async_llm.py:261] Added request cmpl-bb8a8084bd5a49e692a3e7dc300351b1-0. INFO 03-01 19:20:05 [logger.py:42] Received request cmpl-dd20424c92d14f6ca2644dcf3a90abdb-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:05 [async_llm.py:261] Added request cmpl-dd20424c92d14f6ca2644dcf3a90abdb-0. INFO 03-01 19:20:06 [logger.py:42] Received request cmpl-9adf0602da234ccda2b34623ed924943-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:06 [async_llm.py:261] Added request cmpl-9adf0602da234ccda2b34623ed924943-0. INFO 03-01 19:20:07 [logger.py:42] Received request cmpl-4b62ea8265f9488fb6efce19e07f608e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:07 [async_llm.py:261] Added request cmpl-4b62ea8265f9488fb6efce19e07f608e-0. INFO 03-01 19:20:08 [logger.py:42] Received request cmpl-48bd7d5fea224fa5a7086056bb536688-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:08 [async_llm.py:261] Added request cmpl-48bd7d5fea224fa5a7086056bb536688-0. INFO 03-01 19:20:09 [logger.py:42] Received request cmpl-6420d7d3d119432b8fc51c7f6801c457-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:09 [async_llm.py:261] Added request cmpl-6420d7d3d119432b8fc51c7f6801c457-0. INFO 03-01 19:20:11 [logger.py:42] Received request cmpl-d93207b1d35b48e4929c3b9f1aa3084b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:11 [async_llm.py:261] Added request cmpl-d93207b1d35b48e4929c3b9f1aa3084b-0. INFO 03-01 19:20:12 [logger.py:42] Received request cmpl-05408590331248e29f71b5b6a09fddaf-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:12 [async_llm.py:261] Added request cmpl-05408590331248e29f71b5b6a09fddaf-0. INFO 03-01 19:20:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.3 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.4%, Prefix cache hit rate: 51.6% INFO 03-01 19:20:13 [logger.py:42] Received request cmpl-bc749a44eef84fe785f13c5af896f8c2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:13 [async_llm.py:261] Added request cmpl-bc749a44eef84fe785f13c5af896f8c2-0. INFO 03-01 19:20:14 [logger.py:42] Received request cmpl-3793d052173e41848063470e2f9967c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:14 [async_llm.py:261] Added request cmpl-3793d052173e41848063470e2f9967c7-0. INFO 03-01 19:20:15 [logger.py:42] Received request cmpl-a36750837f364ba88b4429bd8c67b1f5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:15 [async_llm.py:261] Added request cmpl-a36750837f364ba88b4429bd8c67b1f5-0. INFO 03-01 19:20:16 [logger.py:42] Received request cmpl-9e1db834325942099bdd4950fd576266-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:16 [async_llm.py:261] Added request cmpl-9e1db834325942099bdd4950fd576266-0. INFO 03-01 19:20:17 [logger.py:42] Received request cmpl-dee5e19c914f425baf1e29ea28d6b008-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:17 [async_llm.py:261] Added request cmpl-dee5e19c914f425baf1e29ea28d6b008-0. INFO 03-01 19:20:19 [logger.py:42] Received request cmpl-838c9cacf3144e88b560f54e08e7c670-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:19 [async_llm.py:261] Added request cmpl-838c9cacf3144e88b560f54e08e7c670-0. INFO 03-01 19:20:20 [logger.py:42] Received request cmpl-59d509a99b504497bec82f2cc4d68d69-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:20 [async_llm.py:261] Added request cmpl-59d509a99b504497bec82f2cc4d68d69-0. INFO 03-01 19:20:21 [logger.py:42] Received request cmpl-da97401efbc948a680cc52f3ea32c2f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:21 [async_llm.py:261] Added request cmpl-da97401efbc948a680cc52f3ea32c2f1-0. INFO 03-01 19:20:22 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:20:22 [logger.py:42] Received request cmpl-94fc60cf8a4149708e2d9a06e7532499-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:22 [async_llm.py:261] Added request cmpl-94fc60cf8a4149708e2d9a06e7532499-0. INFO 03-01 19:20:23 [logger.py:42] Received request cmpl-35fec8d7b2664abdb3f58e1438c3660c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:23 [async_llm.py:261] Added request cmpl-35fec8d7b2664abdb3f58e1438c3660c-0. INFO 03-01 19:20:24 [logger.py:42] Received request cmpl-123175eea565495bb25eacaf51e13f38-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:24 [async_llm.py:261] Added request cmpl-123175eea565495bb25eacaf51e13f38-0. INFO 03-01 19:20:26 [logger.py:42] Received request cmpl-6737d96aaf334f10b9502055af3ed852-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:26 [async_llm.py:261] Added request cmpl-6737d96aaf334f10b9502055af3ed852-0. INFO 03-01 19:20:27 [logger.py:42] Received request cmpl-310cd6a24c274c589c073d5e5e2e0c36-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:27 [async_llm.py:261] Added request cmpl-310cd6a24c274c589c073d5e5e2e0c36-0. INFO 03-01 19:20:28 [logger.py:42] Received request cmpl-530b3d0ae49f495999aa74c2ef8650e6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:28 [async_llm.py:261] Added request cmpl-530b3d0ae49f495999aa74c2ef8650e6-0. INFO 03-01 19:20:29 [logger.py:42] Received request cmpl-f235e372b58a48d3a33ebf73ac7f6fca-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:29 [async_llm.py:261] Added request cmpl-f235e372b58a48d3a33ebf73ac7f6fca-0. INFO 03-01 19:20:30 [logger.py:42] Received request cmpl-5b91f235aed64ca5b37c38fc814c0645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:30 [async_llm.py:261] Added request cmpl-5b91f235aed64ca5b37c38fc814c0645-0. INFO 03-01 19:20:31 [logger.py:42] Received request cmpl-ea192f334d9c4260987b39991823e8e1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:31 [async_llm.py:261] Added request cmpl-ea192f334d9c4260987b39991823e8e1-0. INFO 03-01 19:20:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:20:33 [logger.py:42] Received request cmpl-975ed2ebc3064f1db22b2cd8a8431c07-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:33 [async_llm.py:261] Added request cmpl-975ed2ebc3064f1db22b2cd8a8431c07-0. INFO 03-01 19:20:34 [logger.py:42] Received request cmpl-b393a40fafdf4cfab89d4d01d41d1bcd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:34 [async_llm.py:261] Added request cmpl-b393a40fafdf4cfab89d4d01d41d1bcd-0. INFO 03-01 19:20:35 [logger.py:42] Received request cmpl-361f7c1f41ee4eef8c88edc38ce71ace-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:35 [async_llm.py:261] Added request cmpl-361f7c1f41ee4eef8c88edc38ce71ace-0. INFO 03-01 19:20:36 [logger.py:42] Received request cmpl-9295854c94c643dca1e170576338d253-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:36 [async_llm.py:261] Added request cmpl-9295854c94c643dca1e170576338d253-0. INFO 03-01 19:20:37 [logger.py:42] Received request cmpl-67a1e79beed24298963bac63783ba2b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:37 [async_llm.py:261] Added request cmpl-67a1e79beed24298963bac63783ba2b1-0. INFO 03-01 19:20:38 [logger.py:42] Received request cmpl-33daecd5a61448bdbd95dea9c40fb0f1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:38 [async_llm.py:261] Added request cmpl-33daecd5a61448bdbd95dea9c40fb0f1-0. INFO 03-01 19:20:39 [logger.py:42] Received request cmpl-c6bd9a250db8449f86dbd5f351a47288-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:39 [async_llm.py:261] Added request cmpl-c6bd9a250db8449f86dbd5f351a47288-0. INFO 03-01 19:20:41 [logger.py:42] Received request cmpl-2ebff2865db64870a2faf3096c40b619-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:41 [async_llm.py:261] Added request cmpl-2ebff2865db64870a2faf3096c40b619-0. INFO 03-01 19:20:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:20:42 [logger.py:42] Received request cmpl-0bd60e78ed014d02bcd0a5b265558f62-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:42 [async_llm.py:261] Added request cmpl-0bd60e78ed014d02bcd0a5b265558f62-0. INFO 03-01 19:20:43 [logger.py:42] Received request cmpl-a6d40969cd6d4f14827bdab1e4a9b205-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:43 [async_llm.py:261] Added request cmpl-a6d40969cd6d4f14827bdab1e4a9b205-0. INFO 03-01 19:20:44 [logger.py:42] Received request cmpl-ba64eb11ef6d4d80b8c97c2de5799ed7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:44 [async_llm.py:261] Added request cmpl-ba64eb11ef6d4d80b8c97c2de5799ed7-0. INFO 03-01 19:20:45 [logger.py:42] Received request cmpl-5aa2f53b68774e6c8cd5abae7be30d08-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:45 [async_llm.py:261] Added request cmpl-5aa2f53b68774e6c8cd5abae7be30d08-0. INFO 03-01 19:20:46 [logger.py:42] Received request cmpl-a121e7ead48447d78164c78063152d72-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:46 [async_llm.py:261] Added request cmpl-a121e7ead48447d78164c78063152d72-0. INFO 03-01 19:20:48 [logger.py:42] Received request cmpl-82c682981d054ecaa0dceba2fc82f785-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:48 [async_llm.py:261] Added request cmpl-82c682981d054ecaa0dceba2fc82f785-0. INFO 03-01 19:20:49 [logger.py:42] Received request cmpl-2c5958c38b884909a62608e2e519b591-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:49 [async_llm.py:261] Added request cmpl-2c5958c38b884909a62608e2e519b591-0. INFO 03-01 19:20:50 [logger.py:42] Received request cmpl-976e824e87bc4c208cd89e2aa02e112d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:50 [async_llm.py:261] Added request cmpl-976e824e87bc4c208cd89e2aa02e112d-0. INFO 03-01 19:20:51 [logger.py:42] Received request cmpl-30cc8ef466494481a341f4feb5ea06b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:51 [async_llm.py:261] Added request cmpl-30cc8ef466494481a341f4feb5ea06b0-0. INFO 03-01 19:20:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:20:52 [logger.py:42] Received request cmpl-8d34fbda5ad845e280ff462d288b216a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:52 [async_llm.py:261] Added request cmpl-8d34fbda5ad845e280ff462d288b216a-0. INFO 03-01 19:20:53 [logger.py:42] Received request cmpl-8e757a2cdcd64af689452dc6389070ba-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:53 [async_llm.py:261] Added request cmpl-8e757a2cdcd64af689452dc6389070ba-0. INFO 03-01 19:20:55 [logger.py:42] Received request cmpl-52cb21aa45fa40b09d039bc7b7bd7c7e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:55 [async_llm.py:261] Added request cmpl-52cb21aa45fa40b09d039bc7b7bd7c7e-0. INFO 03-01 19:20:56 [logger.py:42] Received request cmpl-9a33c4e423f64d0bbdf2dab8ccde1c59-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:56 [async_llm.py:261] Added request cmpl-9a33c4e423f64d0bbdf2dab8ccde1c59-0. INFO 03-01 19:20:57 [logger.py:42] Received request cmpl-fc852a0faa1a46ebb669e4e5b32fe0c7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:57 [async_llm.py:261] Added request cmpl-fc852a0faa1a46ebb669e4e5b32fe0c7-0. INFO 03-01 19:20:58 [logger.py:42] Received request cmpl-0ef9a986da964af6b62a0da112887c51-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:58 [async_llm.py:261] Added request cmpl-0ef9a986da964af6b62a0da112887c51-0. INFO 03-01 19:20:59 [logger.py:42] Received request cmpl-b2232fb8b10049a1bb1cf796327bb7c9-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:20:59 [async_llm.py:261] Added request cmpl-b2232fb8b10049a1bb1cf796327bb7c9-0. INFO 03-01 19:21:00 [logger.py:42] Received request cmpl-16b732e2172146c98c33e65e96b156b1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:00 [async_llm.py:261] Added request cmpl-16b732e2172146c98c33e65e96b156b1-0. INFO 03-01 19:21:01 [logger.py:42] Received request cmpl-bbabd75d88394f05b5c6af93cb782866-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:01 [async_llm.py:261] Added request cmpl-bbabd75d88394f05b5c6af93cb782866-0. INFO 03-01 19:21:02 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:21:03 [logger.py:42] Received request cmpl-c86e97e068ae46349baf207159e04b04-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:03 [async_llm.py:261] Added request cmpl-c86e97e068ae46349baf207159e04b04-0. INFO 03-01 19:21:04 [logger.py:42] Received request cmpl-e6615a143c8542a1b249d4de84294645-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:04 [async_llm.py:261] Added request cmpl-e6615a143c8542a1b249d4de84294645-0. INFO 03-01 19:21:05 [logger.py:42] Received request cmpl-f522340aa159411fb6e9f4a0b84071cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:05 [async_llm.py:261] Added request cmpl-f522340aa159411fb6e9f4a0b84071cc-0. INFO 03-01 19:21:06 [logger.py:42] Received request cmpl-9f5f47b9fe094b06a82ccebbd53d4976-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:06 [async_llm.py:261] Added request cmpl-9f5f47b9fe094b06a82ccebbd53d4976-0. INFO 03-01 19:21:07 [logger.py:42] Received request cmpl-ab8d430100e248bb8d2c8ce2db469854-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:07 [async_llm.py:261] Added request cmpl-ab8d430100e248bb8d2c8ce2db469854-0. INFO 03-01 19:21:08 [logger.py:42] Received request cmpl-3a8db81f16b2450d9fca6ef963774f8d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:08 [async_llm.py:261] Added request cmpl-3a8db81f16b2450d9fca6ef963774f8d-0. INFO 03-01 19:21:10 [logger.py:42] Received request cmpl-2b6cf184cf7146cfba89122657bab7a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:10 [async_llm.py:261] Added request cmpl-2b6cf184cf7146cfba89122657bab7a7-0. INFO 03-01 19:21:11 [logger.py:42] Received request cmpl-ad1818dfd6154bb1a6b34b7dcc1d159e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:11 [async_llm.py:261] Added request cmpl-ad1818dfd6154bb1a6b34b7dcc1d159e-0. INFO 03-01 19:21:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:21:12 [logger.py:42] Received request cmpl-7d38c04404864c0bafc7439a97cbe650-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:12 [async_llm.py:261] Added request cmpl-7d38c04404864c0bafc7439a97cbe650-0. INFO 03-01 19:21:13 [logger.py:42] Received request cmpl-ec270b4e76e44a89bce715ac7fbaf8c1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:13 [async_llm.py:261] Added request cmpl-ec270b4e76e44a89bce715ac7fbaf8c1-0. INFO 03-01 19:21:14 [logger.py:42] Received request cmpl-bb54146022df4a1a8c1e0036bc89299f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:14 [async_llm.py:261] Added request cmpl-bb54146022df4a1a8c1e0036bc89299f-0. INFO 03-01 19:21:15 [logger.py:42] Received request cmpl-ce65dbe9d83c4da488cf74f07ad1a7fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:15 [async_llm.py:261] Added request cmpl-ce65dbe9d83c4da488cf74f07ad1a7fa-0. INFO 03-01 19:21:17 [logger.py:42] Received request cmpl-d10bd1f42ce94bffa9963debe951b584-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:17 [async_llm.py:261] Added request cmpl-d10bd1f42ce94bffa9963debe951b584-0. INFO 03-01 19:21:18 [logger.py:42] Received request cmpl-9ca7828134184b0cb0aaf0a9d359a907-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:18 [async_llm.py:261] Added request cmpl-9ca7828134184b0cb0aaf0a9d359a907-0. INFO 03-01 19:21:19 [logger.py:42] Received request cmpl-75bb97cb7c3d4bde9ba3fc4e1f49fb77-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:19 [async_llm.py:261] Added request cmpl-75bb97cb7c3d4bde9ba3fc4e1f49fb77-0. INFO 03-01 19:21:20 [logger.py:42] Received request cmpl-77067313b338462fbb7a4a1506a3ab7a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:20 [async_llm.py:261] Added request cmpl-77067313b338462fbb7a4a1506a3ab7a-0. INFO 03-01 19:21:21 [logger.py:42] Received request cmpl-ee98b0e86f784e5d93424b3429558e9b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:21 [async_llm.py:261] Added request cmpl-ee98b0e86f784e5d93424b3429558e9b-0. INFO 03-01 19:21:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:21:22 [logger.py:42] Received request cmpl-5b54cee834e84b26b8d7f798a55f7c71-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:22 [async_llm.py:261] Added request cmpl-5b54cee834e84b26b8d7f798a55f7c71-0. INFO 03-01 19:21:23 [logger.py:42] Received request cmpl-33dc0c67b13647fb97bb05d18a073b63-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:23 [async_llm.py:261] Added request cmpl-33dc0c67b13647fb97bb05d18a073b63-0. INFO 03-01 19:21:25 [logger.py:42] Received request cmpl-f6541f4f7c1c4c7186e227289b94b7af-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:25 [async_llm.py:261] Added request cmpl-f6541f4f7c1c4c7186e227289b94b7af-0. INFO 03-01 19:21:26 [logger.py:42] Received request cmpl-20a1aab021ee4573a61150d7b4853d78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:26 [async_llm.py:261] Added request cmpl-20a1aab021ee4573a61150d7b4853d78-0. INFO 03-01 19:21:27 [logger.py:42] Received request cmpl-fe03a8c9cdf34df594ad949b6ac86e5c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:27 [async_llm.py:261] Added request cmpl-fe03a8c9cdf34df594ad949b6ac86e5c-0. INFO 03-01 19:21:28 [logger.py:42] Received request cmpl-1635c910b29445b4a6b2637760a58444-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:28 [async_llm.py:261] Added request cmpl-1635c910b29445b4a6b2637760a58444-0. INFO 03-01 19:21:29 [logger.py:42] Received request cmpl-0710b6d6082648d3ba9d77212a0284d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:29 [async_llm.py:261] Added request cmpl-0710b6d6082648d3ba9d77212a0284d6-0. INFO 03-01 19:21:30 [logger.py:42] Received request cmpl-9489a7272c924edda0a7bd71c8e4cc3f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:30 [async_llm.py:261] Added request cmpl-9489a7272c924edda0a7bd71c8e4cc3f-0. INFO 03-01 19:21:32 [logger.py:42] Received request cmpl-9806e1c90bab4fb4aa87416eb5e993e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:32 [async_llm.py:261] Added request cmpl-9806e1c90bab4fb4aa87416eb5e993e3-0. INFO 03-01 19:21:32 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:21:33 [logger.py:42] Received request cmpl-7ae068ba84c7479f889f3bf12628066f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:33 [async_llm.py:261] Added request cmpl-7ae068ba84c7479f889f3bf12628066f-0. INFO 03-01 19:21:34 [logger.py:42] Received request cmpl-f0b0f7804beb44a78fb43eba4f5faac0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:34 [async_llm.py:261] Added request cmpl-f0b0f7804beb44a78fb43eba4f5faac0-0. INFO 03-01 19:21:35 [logger.py:42] Received request cmpl-3624bb2a86a642308e09a398ea62515d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:35 [async_llm.py:261] Added request cmpl-3624bb2a86a642308e09a398ea62515d-0. INFO 03-01 19:21:36 [logger.py:42] Received request cmpl-158a50b439174355b1795792c1d94f2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:36 [async_llm.py:261] Added request cmpl-158a50b439174355b1795792c1d94f2e-0. INFO 03-01 19:21:37 [logger.py:42] Received request cmpl-7a21a9f7ddcf4abb9262e30d95de7b21-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:37 [async_llm.py:261] Added request cmpl-7a21a9f7ddcf4abb9262e30d95de7b21-0. INFO 03-01 19:21:39 [logger.py:42] Received request cmpl-d3827a13ac894ff99b76490ec77aeb4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:39 [async_llm.py:261] Added request cmpl-d3827a13ac894ff99b76490ec77aeb4e-0. INFO 03-01 19:21:40 [logger.py:42] Received request cmpl-1154a3464e164b79acb922533e290b3b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:40 [async_llm.py:261] Added request cmpl-1154a3464e164b79acb922533e290b3b-0. INFO 03-01 19:21:41 [logger.py:42] Received request cmpl-bae8da16b43b420eae9596274fe60e42-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:41 [async_llm.py:261] Added request cmpl-bae8da16b43b420eae9596274fe60e42-0. INFO 03-01 19:21:42 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:21:42 [logger.py:42] Received request cmpl-cc79c5b9b98845258f6596a024a94676-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:42 [async_llm.py:261] Added request cmpl-cc79c5b9b98845258f6596a024a94676-0. INFO 03-01 19:21:43 [logger.py:42] Received request cmpl-1e42d203ec9d4031921c1633f4bb8785-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:43 [async_llm.py:261] Added request cmpl-1e42d203ec9d4031921c1633f4bb8785-0. INFO 03-01 19:21:44 [logger.py:42] Received request cmpl-5c40f5946fdd4ce9a4cc1821bf3c65be-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:44 [async_llm.py:261] Added request cmpl-5c40f5946fdd4ce9a4cc1821bf3c65be-0. INFO 03-01 19:21:45 [logger.py:42] Received request cmpl-2fa4ee2a7d3848c9850682be048a426c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:45 [async_llm.py:261] Added request cmpl-2fa4ee2a7d3848c9850682be048a426c-0. INFO 03-01 19:21:47 [logger.py:42] Received request cmpl-2bf751694d0a429d82c68aa55ff9aebc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:47 [async_llm.py:261] Added request cmpl-2bf751694d0a429d82c68aa55ff9aebc-0. INFO 03-01 19:21:48 [logger.py:42] Received request cmpl-e77a647cd688483e98254036afb58481-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:48 [async_llm.py:261] Added request cmpl-e77a647cd688483e98254036afb58481-0. INFO 03-01 19:21:49 [logger.py:42] Received request cmpl-ab09a9efdc0649c7b360846bb8db5bb7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:49 [async_llm.py:261] Added request cmpl-ab09a9efdc0649c7b360846bb8db5bb7-0. INFO 03-01 19:21:50 [logger.py:42] Received request cmpl-77bbf16ec4474a3694fe99dd93037005-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:50 [async_llm.py:261] Added request cmpl-77bbf16ec4474a3694fe99dd93037005-0. INFO 03-01 19:21:51 [logger.py:42] Received request cmpl-1b8565f423814a68a7ed3a7829cc921d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:51 [async_llm.py:261] Added request cmpl-1b8565f423814a68a7ed3a7829cc921d-0. INFO 03-01 19:21:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:21:52 [logger.py:42] Received request cmpl-9553fe2007d147d5859f16747de4c222-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:52 [async_llm.py:261] Added request cmpl-9553fe2007d147d5859f16747de4c222-0. INFO 03-01 19:21:54 [logger.py:42] Received request cmpl-239e7c10fd27419ba080379938ae0d2a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:54 [async_llm.py:261] Added request cmpl-239e7c10fd27419ba080379938ae0d2a-0. INFO 03-01 19:21:55 [logger.py:42] Received request cmpl-330e4291c56a4f139b4a376d8b9fd67f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:55 [async_llm.py:261] Added request cmpl-330e4291c56a4f139b4a376d8b9fd67f-0. INFO 03-01 19:21:56 [logger.py:42] Received request cmpl-1c49a5e9fc4d48f59dda092ec69fcb6f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:56 [async_llm.py:261] Added request cmpl-1c49a5e9fc4d48f59dda092ec69fcb6f-0. INFO 03-01 19:21:57 [logger.py:42] Received request cmpl-331711a10c124ba2b465c9d0684818a7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:57 [async_llm.py:261] Added request cmpl-331711a10c124ba2b465c9d0684818a7-0. INFO 03-01 19:21:58 [logger.py:42] Received request cmpl-ce4cdb0f4ba64f4586b0f9c1220a9c2e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:58 [async_llm.py:261] Added request cmpl-ce4cdb0f4ba64f4586b0f9c1220a9c2e-0. INFO 03-01 19:21:59 [logger.py:42] Received request cmpl-9676ac7e072a4c6dbdfc6de1bde13641-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=800, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.4:123 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:59 [async_llm.py:261] Added request cmpl-9676ac7e072a4c6dbdfc6de1bde13641-0. INFO 03-01 19:21:59 [logger.py:42] Received request cmpl-89e885d63fd84907a4f95de3ab984398-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:21:59 [async_llm.py:261] Added request cmpl-89e885d63fd84907a4f95de3ab984398-0. INFO 03-01 19:22:01 [logger.py:42] Received request cmpl-b104cdc61a8948f197901190f8a69b89-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:01 [async_llm.py:261] Added request cmpl-b104cdc61a8948f197901190f8a69b89-0. INFO 03-01 19:22:02 [logger.py:42] Received request cmpl-e96e244893884e299b35a6d1f56235fa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:02 [async_llm.py:261] Added request cmpl-e96e244893884e299b35a6d1f56235fa-0. INFO 03-01 19:22:02 [loggers.py:116] Engine 000: Avg prompt throughput: 31.0 tokens/s, Avg generation throughput: 14.0 tokens/s, Running: 2 reqs, Waiting: 0 reqs, GPU KV cache usage: 1.0%, Prefix cache hit rate: 51.6% INFO 03-01 19:22:03 [logger.py:42] Received request cmpl-bbabd7f3512f471cb3461ac744b1f854-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:03 [async_llm.py:261] Added request cmpl-bbabd7f3512f471cb3461ac744b1f854-0. INFO 03-01 19:22:04 [logger.py:42] Received request cmpl-1be4bf588f0b48ec8627b6948d9e06a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:04 [async_llm.py:261] Added request cmpl-1be4bf588f0b48ec8627b6948d9e06a5-0. INFO 03-01 19:22:05 [logger.py:42] Received request cmpl-685f4d2f08744bcf89dd0871675a6c1d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:05 [async_llm.py:261] Added request cmpl-685f4d2f08744bcf89dd0871675a6c1d-0. INFO 03-01 19:22:06 [logger.py:42] Received request cmpl-e772411bd84b448cafc317f99d40976c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:06 [async_llm.py:261] Added request cmpl-e772411bd84b448cafc317f99d40976c-0. INFO 03-01 19:22:08 [logger.py:42] Received request cmpl-f6f32091f7e84af68316cb4b9adf4675-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:08 [async_llm.py:261] Added request cmpl-f6f32091f7e84af68316cb4b9adf4675-0. INFO 03-01 19:22:09 [logger.py:42] Received request cmpl-0c5dbfa70f244cf69da1914af7aa4204-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:09 [async_llm.py:261] Added request cmpl-0c5dbfa70f244cf69da1914af7aa4204-0. INFO 03-01 19:22:10 [logger.py:42] Received request cmpl-cd00efdd48274c9bb870de17989deb1e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:10 [async_llm.py:261] Added request cmpl-cd00efdd48274c9bb870de17989deb1e-0. INFO 03-01 19:22:11 [logger.py:42] Received request cmpl-cf4aa41e97a343ee953be6b142ef7fa5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:11 [async_llm.py:261] Added request cmpl-cf4aa41e97a343ee953be6b142ef7fa5-0. INFO 03-01 19:22:12 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 29.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:22:12 [logger.py:42] Received request cmpl-13b6baae38ae4ed5aaf5d9829e4f1b61-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:12 [async_llm.py:261] Added request cmpl-13b6baae38ae4ed5aaf5d9829e4f1b61-0. INFO 03-01 19:22:13 [logger.py:42] Received request cmpl-2aa5256ad9a74e2083c3d5dede2e2076-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:13 [async_llm.py:261] Added request cmpl-2aa5256ad9a74e2083c3d5dede2e2076-0. INFO 03-01 19:22:15 [logger.py:42] Received request cmpl-5252d157a5d74a4b824fc89d9a65f3a5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:15 [async_llm.py:261] Added request cmpl-5252d157a5d74a4b824fc89d9a65f3a5-0. INFO 03-01 19:22:16 [logger.py:42] Received request cmpl-c44734899671427398df4910cc8bd25f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:16 [async_llm.py:261] Added request cmpl-c44734899671427398df4910cc8bd25f-0. INFO 03-01 19:22:17 [logger.py:42] Received request cmpl-b4b0e9d273b54ee782f939e3800a4383-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:17 [async_llm.py:261] Added request cmpl-b4b0e9d273b54ee782f939e3800a4383-0. INFO 03-01 19:22:18 [logger.py:42] Received request cmpl-ec9379330d3b4e48bdd2daa85b7c3718-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:18 [async_llm.py:261] Added request cmpl-ec9379330d3b4e48bdd2daa85b7c3718-0. INFO 03-01 19:22:19 [logger.py:42] Received request cmpl-b617b56a00904b6891cf930e8e4d3ae1-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:19 [async_llm.py:261] Added request cmpl-b617b56a00904b6891cf930e8e4d3ae1-0. INFO 03-01 19:22:20 [logger.py:42] Received request cmpl-5ea2469eb63d45e080c46dc87a26f30b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:20 [async_llm.py:261] Added request cmpl-5ea2469eb63d45e080c46dc87a26f30b-0. INFO 03-01 19:22:21 [logger.py:42] Received request cmpl-5985e4364e9947d594bcceb97d72a554-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:21 [async_llm.py:261] Added request cmpl-5985e4364e9947d594bcceb97d72a554-0. INFO 03-01 19:22:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:22:23 [logger.py:42] Received request cmpl-63db1fd0b86c4173b65832cf3905b5b0-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:23 [async_llm.py:261] Added request cmpl-63db1fd0b86c4173b65832cf3905b5b0-0. INFO 03-01 19:22:24 [logger.py:42] Received request cmpl-6556d0d672cc4f4bb9c0fd6c21e26b2b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:24 [async_llm.py:261] Added request cmpl-6556d0d672cc4f4bb9c0fd6c21e26b2b-0. INFO 03-01 19:22:25 [logger.py:42] Received request cmpl-55a85862559042e2886c98e5ba6e5bfd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:25 [async_llm.py:261] Added request cmpl-55a85862559042e2886c98e5ba6e5bfd-0. INFO 03-01 19:22:26 [logger.py:42] Received request cmpl-018a4049ac5a430a969115fe95c97fde-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:26 [async_llm.py:261] Added request cmpl-018a4049ac5a430a969115fe95c97fde-0. INFO 03-01 19:22:27 [logger.py:42] Received request cmpl-fc089a349698437eaa0bef524168c1b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:27 [async_llm.py:261] Added request cmpl-fc089a349698437eaa0bef524168c1b5-0. INFO 03-01 19:22:28 [logger.py:42] Received request cmpl-366316517c9b41ba981fc8f853318669-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:28 [async_llm.py:261] Added request cmpl-366316517c9b41ba981fc8f853318669-0. INFO 03-01 19:22:30 [logger.py:42] Received request cmpl-9248db3eeda04ffdb90935032f37f6ce-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:30 [async_llm.py:261] Added request cmpl-9248db3eeda04ffdb90935032f37f6ce-0. INFO 03-01 19:22:31 [logger.py:42] Received request cmpl-2bffa41d71904bf3af257923bece71cc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:31 [async_llm.py:261] Added request cmpl-2bffa41d71904bf3af257923bece71cc-0. INFO 03-01 19:22:32 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:22:32 [logger.py:42] Received request cmpl-83486cf1fbb541a68be7fe87ad08d383-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:32 [async_llm.py:261] Added request cmpl-83486cf1fbb541a68be7fe87ad08d383-0. INFO 03-01 19:22:33 [logger.py:42] Received request cmpl-d5bc951e45da4106aa8f6e9adbda0b78-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:33 [async_llm.py:261] Added request cmpl-d5bc951e45da4106aa8f6e9adbda0b78-0. INFO 03-01 19:22:34 [logger.py:42] Received request cmpl-f5388e35a61a42d88d7e01eeee56b62d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:34 [async_llm.py:261] Added request cmpl-f5388e35a61a42d88d7e01eeee56b62d-0. INFO 03-01 19:22:35 [logger.py:42] Received request cmpl-01a0b7631a634d31ab996777ca089cfa-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:35 [async_llm.py:261] Added request cmpl-01a0b7631a634d31ab996777ca089cfa-0. INFO 03-01 19:22:37 [logger.py:42] Received request cmpl-469ab7ff96944066ad59e34e704d428e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:37 [async_llm.py:261] Added request cmpl-469ab7ff96944066ad59e34e704d428e-0. INFO 03-01 19:22:38 [logger.py:42] Received request cmpl-9edbef45c49242c79a322fc5195b5acc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:38 [async_llm.py:261] Added request cmpl-9edbef45c49242c79a322fc5195b5acc-0. INFO 03-01 19:22:39 [logger.py:42] Received request cmpl-b487ed408d794ee1b8c378397faf924b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:39 [async_llm.py:261] Added request cmpl-b487ed408d794ee1b8c378397faf924b-0. INFO 03-01 19:22:40 [logger.py:42] Received request cmpl-0bb74fd3a65642b282bce4cac7f14c06-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:40 [async_llm.py:261] Added request cmpl-0bb74fd3a65642b282bce4cac7f14c06-0. INFO 03-01 19:22:41 [logger.py:42] Received request cmpl-c248b9753bad40b4969a0f7cda72b935-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:41 [async_llm.py:261] Added request cmpl-c248b9753bad40b4969a0f7cda72b935-0. INFO 03-01 19:22:42 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:22:42 [logger.py:42] Received request cmpl-6d312a5e3eb046b8b3138d857bde9a68-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:42 [async_llm.py:261] Added request cmpl-6d312a5e3eb046b8b3138d857bde9a68-0. INFO 03-01 19:22:43 [logger.py:42] Received request cmpl-66fa66fefb64467399393cb8d321639c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:43 [async_llm.py:261] Added request cmpl-66fa66fefb64467399393cb8d321639c-0. INFO 03-01 19:22:45 [logger.py:42] Received request cmpl-28e7c2c50dfe49ea8de846e4b8d3157c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:45 [async_llm.py:261] Added request cmpl-28e7c2c50dfe49ea8de846e4b8d3157c-0. INFO 03-01 19:22:46 [logger.py:42] Received request cmpl-92b5b5353fc94bffb71c0b819888a263-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:46 [async_llm.py:261] Added request cmpl-92b5b5353fc94bffb71c0b819888a263-0. INFO 03-01 19:22:47 [logger.py:42] Received request cmpl-66d487920cee4e8ebc2688c25b11876e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:47 [async_llm.py:261] Added request cmpl-66d487920cee4e8ebc2688c25b11876e-0. INFO 03-01 19:22:48 [logger.py:42] Received request cmpl-684be465999b43918ca9d19e95a924b7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:48 [async_llm.py:261] Added request cmpl-684be465999b43918ca9d19e95a924b7-0. INFO 03-01 19:22:49 [logger.py:42] Received request cmpl-f36cec2602fc4049980fa243e6b698e3-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:49 [async_llm.py:261] Added request cmpl-f36cec2602fc4049980fa243e6b698e3-0. INFO 03-01 19:22:50 [logger.py:42] Received request cmpl-d9a95b546fe94125b7754af38ed7f2de-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:50 [async_llm.py:261] Added request cmpl-d9a95b546fe94125b7754af38ed7f2de-0. INFO 03-01 19:22:52 [logger.py:42] Received request cmpl-19a278de02ba43a6aca681f6eceeb16c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:52 [async_llm.py:261] Added request cmpl-19a278de02ba43a6aca681f6eceeb16c-0. INFO 03-01 19:22:52 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:22:53 [logger.py:42] Received request cmpl-6f3ecd9b61a345baa2d6cbcde8295e0a-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:53 [async_llm.py:261] Added request cmpl-6f3ecd9b61a345baa2d6cbcde8295e0a-0. INFO 03-01 19:22:54 [logger.py:42] Received request cmpl-7e795a4436f04c45a9a731e2012efe7f-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:54 [async_llm.py:261] Added request cmpl-7e795a4436f04c45a9a731e2012efe7f-0. INFO 03-01 19:22:55 [logger.py:42] Received request cmpl-f93a5e9d1348463197a8e870e91dbbbc-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:55 [async_llm.py:261] Added request cmpl-f93a5e9d1348463197a8e870e91dbbbc-0. INFO 03-01 19:22:56 [logger.py:42] Received request cmpl-378ff71bce784828a32ebc7f03ff4e4e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:56 [async_llm.py:261] Added request cmpl-378ff71bce784828a32ebc7f03ff4e4e-0. INFO 03-01 19:22:57 [logger.py:42] Received request cmpl-72662256950c4672805d20d5894f2157-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:57 [async_llm.py:261] Added request cmpl-72662256950c4672805d20d5894f2157-0. INFO 03-01 19:22:59 [logger.py:42] Received request cmpl-3123cc5d07ea427eb5cbaaf0f2e6ff45-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:22:59 [async_llm.py:261] Added request cmpl-3123cc5d07ea427eb5cbaaf0f2e6ff45-0. INFO 03-01 19:23:00 [logger.py:42] Received request cmpl-4e741683f2524336a1c377d244cdb5ee-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:00 [async_llm.py:261] Added request cmpl-4e741683f2524336a1c377d244cdb5ee-0. INFO 03-01 19:23:01 [logger.py:42] Received request cmpl-ba7e83811e0e4b34ab281104f7e660d6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:01 [async_llm.py:261] Added request cmpl-ba7e83811e0e4b34ab281104f7e660d6-0. INFO 03-01 19:23:02 [loggers.py:116] Engine 000: Avg prompt throughput: 24.8 tokens/s, Avg generation throughput: 4.0 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:23:02 [logger.py:42] Received request cmpl-2f380f5f7ab54c7ea2e519f0a0f44517-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:02 [async_llm.py:261] Added request cmpl-2f380f5f7ab54c7ea2e519f0a0f44517-0. INFO 03-01 19:23:03 [logger.py:42] Received request cmpl-11addf6bfbcf42878dd5ebc9931ed848-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:03 [async_llm.py:261] Added request cmpl-11addf6bfbcf42878dd5ebc9931ed848-0. INFO 03-01 19:23:04 [logger.py:42] Received request cmpl-f0823693d7c54ea4b3ed117bacbc1b7b-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:04 [async_llm.py:261] Added request cmpl-f0823693d7c54ea4b3ed117bacbc1b7b-0. INFO 03-01 19:23:05 [logger.py:42] Received request cmpl-3114d241aa274fd4b69fd4c2aa7aae6e-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:06 [async_llm.py:261] Added request cmpl-3114d241aa274fd4b69fd4c2aa7aae6e-0. INFO 03-01 19:23:07 [logger.py:42] Received request cmpl-2cf213c67dbd42f98624ef6de83ff3b5-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:07 [async_llm.py:261] Added request cmpl-2cf213c67dbd42f98624ef6de83ff3b5-0. INFO 03-01 19:23:08 [logger.py:42] Received request cmpl-2a636a5d230640f287328e47d1c071c6-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:08 [async_llm.py:261] Added request cmpl-2a636a5d230640f287328e47d1c071c6-0. INFO 03-01 19:23:09 [logger.py:42] Received request cmpl-65bf327233d340ca9b2295f6e73cef22-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:09 [async_llm.py:261] Added request cmpl-65bf327233d340ca9b2295f6e73cef22-0. INFO 03-01 19:23:10 [logger.py:42] Received request cmpl-0c89e439ddd84780ae0eb9453c71b891-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:10 [async_llm.py:261] Added request cmpl-0c89e439ddd84780ae0eb9453c71b891-0. INFO 03-01 19:23:11 [logger.py:42] Received request cmpl-663f8ed9bc0a470d93bb4654fb7ce292-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:11 [async_llm.py:261] Added request cmpl-663f8ed9bc0a470d93bb4654fb7ce292-0. INFO 03-01 19:23:12 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.5 tokens/s, Running: 0 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.1%, Prefix cache hit rate: 51.6% INFO 03-01 19:23:12 [logger.py:42] Received request cmpl-6e2609cac10540daba431bba62a75b05-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:12 [async_llm.py:261] Added request cmpl-6e2609cac10540daba431bba62a75b05-0. INFO 03-01 19:23:14 [logger.py:42] Received request cmpl-656ab13b291847ccac41a8be75863625-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:14 [async_llm.py:261] Added request cmpl-656ab13b291847ccac41a8be75863625-0. INFO 03-01 19:23:15 [logger.py:42] Received request cmpl-59bb7b74bcd740bbb9b9e9507b8a1976-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:15 [async_llm.py:261] Added request cmpl-59bb7b74bcd740bbb9b9e9507b8a1976-0. INFO 03-01 19:23:16 [logger.py:42] Received request cmpl-50c8faa4fcd84ff787a7aefc91155ae2-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:16 [async_llm.py:261] Added request cmpl-50c8faa4fcd84ff787a7aefc91155ae2-0. INFO 03-01 19:23:17 [logger.py:42] Received request cmpl-0d55163d5bc84ad1841fa86bd8ff8473-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:17 [async_llm.py:261] Added request cmpl-0d55163d5bc84ad1841fa86bd8ff8473-0. INFO 03-01 19:23:18 [logger.py:42] Received request cmpl-d9d640221d044d848e4e6464d203307c-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:18 [async_llm.py:261] Added request cmpl-d9d640221d044d848e4e6464d203307c-0. INFO 03-01 19:23:19 [logger.py:42] Received request cmpl-2745a5296c6b4269ae7424a95c27db74-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:19 [async_llm.py:261] Added request cmpl-2745a5296c6b4269ae7424a95c27db74-0. INFO 03-01 19:23:21 [logger.py:42] Received request cmpl-b95678532132400786e650df41db3325-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:21 [async_llm.py:261] Added request cmpl-b95678532132400786e650df41db3325-0. INFO 03-01 19:23:22 [logger.py:42] Received request cmpl-8194264d7d734cb4b38771fd9e35d799-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:22 [async_llm.py:261] Added request cmpl-8194264d7d734cb4b38771fd9e35d799-0. INFO 03-01 19:23:22 [loggers.py:116] Engine 000: Avg prompt throughput: 27.9 tokens/s, Avg generation throughput: 4.2 tokens/s, Running: 1 reqs, Waiting: 0 reqs, GPU KV cache usage: 0.3%, Prefix cache hit rate: 51.6% INFO 03-01 19:23:23 [logger.py:42] Received request cmpl-09eed18fb33b45759c5e462a68e0acd7-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:23 [async_llm.py:261] Added request cmpl-09eed18fb33b45759c5e462a68e0acd7-0. INFO 03-01 19:23:24 [logger.py:42] Received request cmpl-b566c5eee8024276adf8c9fcdf8d0230-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:24 [async_llm.py:261] Added request cmpl-b566c5eee8024276adf8c9fcdf8d0230-0. INFO 03-01 19:23:25 [logger.py:42] Received request cmpl-d8f2d46ef5374e26804649a33ff818bd-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:25 [async_llm.py:261] Added request cmpl-d8f2d46ef5374e26804649a33ff818bd-0. INFO 03-01 19:23:26 [logger.py:42] Received request cmpl-8e44e8746979443fbe6ebefb8f28898d-0: prompt: 'what is the integral of x^2 from 0 to 2?\nPlease reason step by step, and put your final answer within \\boxed{}.', params: SamplingParams(n=1, presence_penalty=0.0, frequency_penalty=0.0, repetition_penalty=1.0, temperature=0.0, top_p=1.0, top_k=0, min_p=0.0, seed=None, stop=[], stop_token_ids=[], bad_words=[], include_stop_str_in_output=False, ignore_eos=False, max_tokens=5, min_tokens=0, logprobs=None, prompt_logprobs=None, skip_special_tokens=True, spaces_between_special_tokens=True, truncate_prompt_tokens=None, guided_decoding=None, extra_args=None), prompt_token_ids: [128000, 12840, 374, 279, 26154, 315, 865, 61, 17, 505, 220, 15, 311, 220, 17, 5380, 5618, 2944, 3094, 555, 3094, 11, 323, 2231, 701, 1620, 4320, 2949, 1144, 80175, 47491], prompt_embeds shape: None, lora_request: None, prompt_adapter_request: None. INFO: 1.2.3.5:1235 - "POST /v1/completions HTTP/1.1" 200 OK INFO 03-01 19:23:26 [async_llm.py:261] Added request cmpl-8e44e8746979443fbe6ebefb8f28898d-0. |