ensure initial propmpt is long for perf metrics tests #1471

pavel-esir · 2025-01-03T11:26:46Z

Fix test from #1414
Ticket CVS-155098

ilya-lavrenov · 2025-01-04T08:05:00Z

tests/python_tests/test_llm_pipeline.py

@@ -586,7 +586,8 @@ def run_perf_metrics_collection(model_descr, generation_config: Dict, prompt: st
 def test_perf_metrics(model_descr, generation_config, prompt):
    import time
    start_time = time.perf_counter()
-    perf_metrics = run_perf_metrics_collection(read_model(model_descr), generation_config, prompt)
+    # To ensure the prefill stage takes much more time make initial prompt long.
+    perf_metrics = run_perf_metrics_collection(read_model(model_descr), generation_config, prompt * 200)
    total_time = (time.perf_counter() - start_time) * 1000


in general, assumptions about perf metrics like assert load_time < 1000.0 should not be in tests, because depending on machine / network stability load_time may significantly vary

fixed it here #1478 (comment)

ilya-lavrenov · 2025-01-05T16:36:23Z

tests/python_tests/test_llm_pipeline.py

@@ -586,7 +586,8 @@ def run_perf_metrics_collection(model_descr, generation_config: Dict, prompt: st
 def test_perf_metrics(model_descr, generation_config, prompt):
    import time
    start_time = time.perf_counter()
-    perf_metrics = run_perf_metrics_collection(read_model(model_descr), generation_config, prompt)
+    # To ensure the prefill stage takes much more time make initial prompt long.
+    perf_metrics = run_perf_metrics_collection(read_model(model_descr), generation_config, prompt * 200)


I suppose it's more clear to extend prompt where it's initially created:

(dict(max_new_tokens=20), 'table is made of' * 200),

pavel-esir added this to the 2025.0 milestone Jan 3, 2025

github-actions bot added the category: LLM LLM pipeline (stateful, static) label Jan 3, 2025

ensure initial propmpt is long for perf metrics tests

5c6acbb

ilya-lavrenov reviewed Jan 4, 2025

View reviewed changes

wutthichai46 approved these changes Jan 4, 2025

View reviewed changes

ilya-lavrenov reviewed Jan 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ensure initial propmpt is long for perf metrics tests #1471

ensure initial propmpt is long for perf metrics tests #1471

pavel-esir commented Jan 3, 2025

ilya-lavrenov Jan 4, 2025 •

edited

Loading

ilya-lavrenov Jan 5, 2025

ensure initial propmpt is long for perf metrics tests #1471

Are you sure you want to change the base?

ensure initial propmpt is long for perf metrics tests #1471

Conversation

pavel-esir commented Jan 3, 2025

ilya-lavrenov Jan 4, 2025 • edited Loading

Choose a reason for hiding this comment

ilya-lavrenov Jan 5, 2025

Choose a reason for hiding this comment

ilya-lavrenov Jan 4, 2025 •

edited

Loading