fix: add spec for hyperparameters in task design and coder #995

RolandMinrui · 2025-06-27T06:50:55Z

Description

Motivation and Context

How Has This Been Tested?

If you are adding a new feature, test on your own test scripts.

Screenshots of Test Results (if appropriate):

Your own tests:

Types of changes

Fix bugs
Add new feature
Update documentation

📚 Documentation preview 📚: https://RDAgent--995.org.readthedocs.build/en/995/

rdagent/components/coder/data_science/pipeline/prompts.yaml

rdagent/scenarios/data_science/proposal/exp_gen/proposal.py

rdagent/scenarios/data_science/share.yaml

…rui/fix_hyperparameter_problems

rdagent/scenarios/data_science/dev/prompts.yaml

…p and use sample datasets (2)add some generation tops in coding (3)add evaluation guidelines in evaluation (4)polish the json schema and description

rdagent/scenarios/data_science/loop.py

rdagent/scenarios/data_science/proposal/exp_gen/__init__.py

rdagent/scenarios/data_science/proposal/exp_gen/base.py

you-n-g · 2025-07-03T09:28:46Z

rdagent/scenarios/data_science/proposal/exp_gen/prompt_refine.yaml

+    {% include "scenarios.data_science.share:guidelines.refine" %}
+
+    # Refinement Specification
+    ## Hypothesis: {{ hypothesis.hypothesis }}


I think we can discuss this.

At last we think we should implement this in runner.

you-n-g · 2025-07-03T13:03:42Z

rdagent/scenarios/data_science/proposal/exp_gen/refine.py

+    def gen(self, trace: DSTrace) -> DSExperiment:
+        # Step 0: Prepare
+        pipeline = DS_RD_SETTING.coder_on_whole_pipeline
+        component_desc = T("scenarios.data_science.share:component_description_in_pipeline").r()


use include to simplify it.

you-n-g · 2025-07-04T03:42:46Z

rdagent/scenarios/data_science/share.yaml

    You might receive exploratory data analysis (EDA) details about the source data. Do not use this EDA information to create assertions or raise errors. We might generate sample data for quick coding (so your code may run on sample data which is part of the full-size data), but remember that the EDA details are based on the full-size data.
+  draft: |-
+    TODO
+  refine: |-


low hanging fruit

We don't refine part now

you-n-g · 2025-07-04T03:55:41Z

rdagent/scenarios/data_science/proposal/exp_gen/__init__.py

+                return DSRefineExpGen(scen=self.scen).gen(trace=trace)
+
+        # Propose
+        if DS_RD_SETTING.proposal_version == "v1":


Hope to remove proposal_version in this version.

you-n-g · 2025-07-04T04:15:41Z

rdagent/scenarios/data_science/proposal/exp_gen/prompt_refine.yaml

+    {% include "scenarios.data_science.share:guidelines.refine" %}
+
+    # Refinement Specification
+    ## Hypothesis: {{ hypothesis.hypothesis }}


At last we think we should implement this in runner.

you-n-g · 2025-07-04T04:16:01Z

rdagent/scenarios/data_science/proposal/exp_gen/refine.py

@@ -0,0 +1,105 @@
+import json


We don't need this now.

you-n-g · 2025-07-04T04:16:23Z

rdagent/scenarios/data_science/share.yaml

    You might receive exploratory data analysis (EDA) details about the source data. Do not use this EDA information to create assertions or raise errors. We might generate sample data for quick coding (so your code may run on sample data which is part of the full-size data), but remember that the EDA details are based on the full-size data.
+  draft: |-
+    TODO
+  refine: |-


We don't refine part now

…er_problems

…#995) * init commit * remove the 5-fold spec from prompts * refine the hyperparameter specification * do not sample data * a small spelling issue * refine prompt to avoid submission cheating * do not sample data * simplify code * refine the coder evaluator prompt * refine wording * remove runtime from proposal * refine wording * refine prompt * add gpu info in runtime_info.py * modify the spec * add router and add refinement exp gen * fix prompt bug * use rule-based logic for router * complete the prompt * fix circular import bug * fix bug * make refine_decision optional * update pipeline prompts: (1) add scenary: in an iterative cooding loop and use sample datasets (2)add some generation tops in coding (3)add evaluation guidelines in evaluation (4)polish the json schema and description * fix a small bug * fix a small bug * rdagent/scenarios/data_science/loop.py back to the original version * refactor: replace _get_exp_gen with default_exp_gen for exp generation * import * refactor: make the __init__ back to main * fix small bugs * fix bugs for proposal_version * move refine into runner * check early stop * EDA improvement & coder classes number * fix CI * slightly refine the prompt * remove rule_base_eval and remove useless prompt --------- Co-authored-by: Xu <[email protected]> Co-authored-by: TPLin22 <[email protected]> Co-authored-by: amstrongzyf <[email protected]> Co-authored-by: Xu Yang <[email protected]> Co-authored-by: Xu Yang <[email protected]> Co-authored-by: Young <[email protected]>

init commit

da36f43

RolandMinrui marked this pull request as draft June 27, 2025 06:51

remove the 5-fold spec from prompts

979640a

RolandMinrui changed the title ~~init commit~~ fix: add spec for hyperparameters in task design and coder Jun 27, 2025

Xu added 2 commits June 27, 2025 07:45

refine the hyperparameter specification

2c87022

do not sample data

ccdb471

RolandMinrui requested review from TPLin22, peteryang1 and you-n-g June 27, 2025 07:52

TPLin22 and others added 3 commits June 27, 2025 08:08

a small spelling issue

84bf563

refine prompt to avoid submission cheating

13be390

do not sample data

4ca0411

RolandMinrui removed the request for review from TPLin22 June 27, 2025 08:33

you-n-g reviewed Jun 27, 2025

View reviewed changes

rdagent/components/coder/data_science/pipeline/prompts.yaml Outdated Show resolved Hide resolved

you-n-g reviewed Jun 27, 2025

View reviewed changes

Xu and others added 7 commits June 27, 2025 09:45

simplify code

c122816

refine the coder evaluator prompt

ffec796

refine wording

ffe70ca

remove runtime from proposal

b1f03f2

refine wording

771e7e8

refine prompt

55d8d03

add gpu info in runtime_info.py

3619c95

you-n-g force-pushed the minrui/fix_hyperparameter_problems branch 2 times, most recently from 2c7fa6e to 3619c95 Compare June 29, 2025 09:11

Xu added 5 commits June 30, 2025 03:32

Merge branch 'main' of https://github.com/microsoft/RD-Agent into min…

3f487fe

…rui/fix_hyperparameter_problems

modify the spec

6ec2080

add router and add refinement exp gen

7d27e09

fix prompt bug

b669365

Merge branch 'main' of https://github.com/microsoft/RD-Agent into min…

bbb8bcf

…rui/fix_hyperparameter_problems

Hoder-zyf reviewed Jul 2, 2025

View reviewed changes

rdagent/scenarios/data_science/dev/prompts.yaml Show resolved Hide resolved

Xu and others added 3 commits July 3, 2025 05:07

fix bug

81d284a

make refine_decision optional

a18e454

update pipeline prompts: (1) add scenary: in an iterative cooding loo…

408e7ab

…p and use sample datasets (2)add some generation tops in coding (3)add evaluation guidelines in evaluation (4)polish the json schema and description

you-n-g reviewed Jul 3, 2025

View reviewed changes

peteryang1 added 2 commits July 3, 2025 15:34

fix a small bug

beb3bf8

fix a small bug

93a3acd

you-n-g reviewed Jul 4, 2025

View reviewed changes

peteryangms and others added 10 commits July 4, 2025 06:58

Merge branch 'main' into minrui/fix_hyperparameter_problems

3a15f5c

rdagent/scenarios/data_science/loop.py back to the original version

6d9607a

refactor: replace _get_exp_gen with default_exp_gen for exp generation

8312380

import

ed984eb

refactor: make the __init__ back to main

ceb6335

fix small bugs

833be8f

fix bugs for proposal_version

2e6d190

move refine into runner

71e68c6

Merge branch 'xuyang1/help_minrui_hyppp' into minrui/fix_hyperparamet…

2b8a2ed

…er_problems

check early stop

e56ebfd

Hoder-zyf added a commit that referenced this pull request Jul 4, 2025

change the pipeline prompt to the #995 PR version

3191f50

peteryangms and others added 4 commits July 5, 2025 18:21

Merge branch 'main' into minrui/fix_hyperparameter_problems

7caad02

EDA improvement & coder classes number

eb9ec5d

fix CI

2ebcc35

slightly refine the prompt

65deb7d

peteryang1 marked this pull request as ready for review July 8, 2025 07:11

remove rule_base_eval and remove useless prompt

1edf3a9

peteryang1 merged commit 10246fd into main Jul 8, 2025
9 checks passed

peteryang1 deleted the minrui/fix_hyperparameter_problems branch July 8, 2025 07:22

you-n-g mentioned this pull request Jul 8, 2025

chore(main): release 0.8.0 #1030

Merged

peteryang1 restored the minrui/fix_hyperparameter_problems branch July 8, 2025 10:07

Uh oh!

fix: add spec for hyperparameters in task design and coder #995

fix: add spec for hyperparameters in task design and coder #995

Uh oh!

Conversation

RolandMinrui commented Jun 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

How Has This Been Tested?

Screenshots of Test Results (if appropriate):

Types of changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

RolandMinrui commented Jun 27, 2025 •

edited by github-actions bot

Loading