replicationbench

v1.0

ReplicationBench - A benchmark for evaluating AI agents on reproducing computational results from astrophysics research papers. Adapted from Christine8888/replicationbench-release.

harbor run -d replicationbench@1.0

Tasks (90)

abacus__ewald_force_accuracy
harbor run -d replicationbench@1.0 -t abacus__ewald_force_accuracy
f0afc88
abacus__ewald_force_comparison
harbor run -d replicationbench@1.0 -t abacus__ewald_force_comparison
f0afc88
abacus__lattice_force_error
harbor run -d replicationbench@1.0 -t abacus__lattice_force_error
f0afc88
abacus__lcdm_total_force_accuracy
harbor run -d replicationbench@1.0 -t abacus__lcdm_total_force_accuracy
f0afc88
astm3__cross_modal_photometry_to_spectra_search
harbor run -d replicationbench@1.0 -t astm3__cross_modal_photometry_to_spectra_search
f0afc88
astm3__modality_importance_rot_class_accuracy
harbor run -d replicationbench@1.0 -t astm3__modality_importance_rot_class_accuracy
f0afc88
astm3__multimodal_classification_clip
harbor run -d replicationbench@1.0 -t astm3__multimodal_classification_clip
f0afc88
astm3__photometry_classification_accuracy_no_clip
harbor run -d replicationbench@1.0 -t astm3__photometry_classification_accuracy_no_clip
f0afc88
astm3__photometry_classification_accuracy_with_clip
harbor run -d replicationbench@1.0 -t astm3__photometry_classification_accuracy_with_clip
f0afc88
astm3__spectra_classification_accuracy_limited_data_10_percent
harbor run -d replicationbench@1.0 -t astm3__spectra_classification_accuracy_limited_data_10_percent
f0afc88
astm3__spectral_similarity_search
harbor run -d replicationbench@1.0 -t astm3__spectral_similarity_search
f0afc88
bayes_cal__cold_hot_tandem
harbor run -d replicationbench@1.0 -t bayes_cal__cold_hot_tandem
f0afc88
bayes_cal__cold_temp
harbor run -d replicationbench@1.0 -t bayes_cal__cold_temp
f0afc88
bayes_cal__evidence
harbor run -d replicationbench@1.0 -t bayes_cal__evidence
f0afc88
bayes_cal__hot_temp
harbor run -d replicationbench@1.0 -t bayes_cal__hot_temp
f0afc88
bayes_cal__load_cal
harbor run -d replicationbench@1.0 -t bayes_cal__load_cal
f0afc88
bayes_cal__nwp_set
harbor run -d replicationbench@1.0 -t bayes_cal__nwp_set
f0afc88
chandra_representation__2dae_embedding
harbor run -d replicationbench@1.0 -t chandra_representation__2dae_embedding
f0afc88
chandra_representation__2dpca_embedding
harbor run -d replicationbench@1.0 -t chandra_representation__2dpca_embedding
f0afc88
chandra_representation__blackbody_spectral_fit
harbor run -d replicationbench@1.0 -t chandra_representation__blackbody_spectral_fit
f0afc88
chandra_representation__powerlaw_spectral_fit
harbor run -d replicationbench@1.0 -t chandra_representation__powerlaw_spectral_fit
f0afc88
disk_ridges__gaia_dr2_all
harbor run -d replicationbench@1.0 -t disk_ridges__gaia_dr2_all
f0afc88
disk_ridges__gaia_dr2_rvs
harbor run -d replicationbench@1.0 -t disk_ridges__gaia_dr2_rvs
f0afc88
disk_ridges__peak_mean_vz_all
harbor run -d replicationbench@1.0 -t disk_ridges__peak_mean_vz_all
f0afc88
disk_ridges__ridge_slope
harbor run -d replicationbench@1.0 -t disk_ridges__ridge_slope
f0afc88
disk_ridges__ridges_in_all
harbor run -d replicationbench@1.0 -t disk_ridges__ridges_in_all
f0afc88
eht_resolve__eht_reconstruction
harbor run -d replicationbench@1.0 -t eht_resolve__eht_reconstruction
f0afc88
eht_resolve__eht_ring_orientation_angle
harbor run -d replicationbench@1.0 -t eht_resolve__eht_ring_orientation_angle
f0afc88
eht_resolve__eht_ring_size
harbor run -d replicationbench@1.0 -t eht_resolve__eht_ring_size
f0afc88
eht_resolve__eht_ring_width
harbor run -d replicationbench@1.0 -t eht_resolve__eht_ring_width
f0afc88
galaxy_manifold__data_preparation
harbor run -d replicationbench@1.0 -t galaxy_manifold__data_preparation
f0afc88
galaxy_manifold__evolution_tracks
harbor run -d replicationbench@1.0 -t galaxy_manifold__evolution_tracks
f0afc88
galaxy_manifold__gas_mass_estimation
harbor run -d replicationbench@1.0 -t galaxy_manifold__gas_mass_estimation
f0afc88
galaxy_manifold__manifold_plane
harbor run -d replicationbench@1.0 -t galaxy_manifold__manifold_plane
f0afc88
galaxy_manifold__manifold_recovery
harbor run -d replicationbench@1.0 -t galaxy_manifold__manifold_recovery
f0afc88
galaxy_manifold__morphological_classification
harbor run -d replicationbench@1.0 -t galaxy_manifold__morphological_classification
f0afc88
galaxy_manifold__physical_properties
harbor run -d replicationbench@1.0 -t galaxy_manifold__physical_properties
f0afc88
galaxy_manifold__property_prediction
harbor run -d replicationbench@1.0 -t galaxy_manifold__property_prediction
f0afc88
galaxy_manifold__svd_analysis
harbor run -d replicationbench@1.0 -t galaxy_manifold__svd_analysis
f0afc88
galaxy_manifold__transformation_matrix
harbor run -d replicationbench@1.0 -t galaxy_manifold__transformation_matrix
f0afc88
galaxy_soptics__bcg_identification
harbor run -d replicationbench@1.0 -t galaxy_soptics__bcg_identification
f0afc88
galaxy_soptics__clustering_hyperparameter_optimization
harbor run -d replicationbench@1.0 -t galaxy_soptics__clustering_hyperparameter_optimization
f0afc88
galaxy_soptics__fof_optimization_sdss
harbor run -d replicationbench@1.0 -t galaxy_soptics__fof_optimization_sdss
f0afc88
galaxy_soptics__millennium_data_extraction
harbor run -d replicationbench@1.0 -t galaxy_soptics__millennium_data_extraction
f0afc88
galaxy_soptics__nyu_vagc_processing
harbor run -d replicationbench@1.0 -t galaxy_soptics__nyu_vagc_processing
f0afc88
galaxy_soptics__shi_catalog_acquisition
harbor run -d replicationbench@1.0 -t galaxy_soptics__shi_catalog_acquisition
f0afc88
galaxy_soptics__soptics_implementation
harbor run -d replicationbench@1.0 -t galaxy_soptics__soptics_implementation
f0afc88
galaxy_soptics__soptics_validation_shi
harbor run -d replicationbench@1.0 -t galaxy_soptics__soptics_validation_shi
f0afc88
gw_cosmo__dark_energy
harbor run -d replicationbench@1.0 -t gw_cosmo__dark_energy
f0afc88
gw_cosmo__h0_scaling
harbor run -d replicationbench@1.0 -t gw_cosmo__h0_scaling
f0afc88
gw_cosmo__measure_combo
harbor run -d replicationbench@1.0 -t gw_cosmo__measure_combo
f0afc88
gw_cosmo__modified_gravity
harbor run -d replicationbench@1.0 -t gw_cosmo__modified_gravity
f0afc88
gw_nsbh__default_mbh
harbor run -d replicationbench@1.0 -t gw_nsbh__default_mbh
f0afc88
gw_nsbh__default_mtov
harbor run -d replicationbench@1.0 -t gw_nsbh__default_mtov
f0afc88
gw_nsbh__equal_mass_slope
harbor run -d replicationbench@1.0 -t gw_nsbh__equal_mass_slope
f0afc88
gw_nsbh__load_data
harbor run -d replicationbench@1.0 -t gw_nsbh__load_data
f0afc88
gw_nsbh__mass_gap
harbor run -d replicationbench@1.0 -t gw_nsbh__mass_gap
f0afc88
gw_nsbh__mass_gap_constraint
harbor run -d replicationbench@1.0 -t gw_nsbh__mass_gap_constraint
f0afc88
gw_nsbh__mtov_spin
harbor run -d replicationbench@1.0 -t gw_nsbh__mtov_spin
f0afc88
gw_nsbh__mtov_spin_2
harbor run -d replicationbench@1.0 -t gw_nsbh__mtov_spin_2
f0afc88
gw_nsbh__spin_constraint
harbor run -d replicationbench@1.0 -t gw_nsbh__spin_constraint
f0afc88
hubble_trails__classifier_performance
harbor run -d replicationbench@1.0 -t hubble_trails__classifier_performance
f0afc88
hubble_trails__satellite_chance_post2020_acis
harbor run -d replicationbench@1.0 -t hubble_trails__satellite_chance_post2020_acis
f0afc88
hubble_trails__satellite_chance_post2020_uvis
harbor run -d replicationbench@1.0 -t hubble_trails__satellite_chance_post2020_uvis
f0afc88
hubble_trails__satellite_chance_pre2020_acis
harbor run -d replicationbench@1.0 -t hubble_trails__satellite_chance_pre2020_acis
f0afc88
hubble_trails__satellite_chance_pre2020_uvis
harbor run -d replicationbench@1.0 -t hubble_trails__satellite_chance_pre2020_uvis
f0afc88
hubble_trails__satellite_fractions
harbor run -d replicationbench@1.0 -t hubble_trails__satellite_fractions
f0afc88
hubble_trails__satellite_fractions_increase
harbor run -d replicationbench@1.0 -t hubble_trails__satellite_fractions_increase
f0afc88
lensing_dr6_growth__alens
harbor run -d replicationbench@1.0 -t lensing_dr6_growth__alens
f0afc88
lensing_dr6_growth__params
harbor run -d replicationbench@1.0 -t lensing_dr6_growth__params
f0afc88
ls_cal__antenna_temp
harbor run -d replicationbench@1.0 -t ls_cal__antenna_temp
f0afc88
ls_cal__cab_temp
harbor run -d replicationbench@1.0 -t ls_cal__cab_temp
f0afc88
ls_cal__cold_sparam
harbor run -d replicationbench@1.0 -t ls_cal__cold_sparam
f0afc88
ls_cal__hot_temp
harbor run -d replicationbench@1.0 -t ls_cal__hot_temp
f0afc88
ls_cal__nwp
harbor run -d replicationbench@1.0 -t ls_cal__nwp
f0afc88
mars_clouds__dbscan_optimization
harbor run -d replicationbench@1.0 -t mars_clouds__dbscan_optimization
f0afc88
mars_clouds__dbscan_test
harbor run -d replicationbench@1.0 -t mars_clouds__dbscan_test
f0afc88
muse_outflows__dust_reddening
harbor run -d replicationbench@1.0 -t muse_outflows__dust_reddening
f0afc88
muse_outflows__electron_density
harbor run -d replicationbench@1.0 -t muse_outflows__electron_density
f0afc88
muse_outflows__narrow_and_broad_line_decomposition_for_j080427
harbor run -d replicationbench@1.0 -t muse_outflows__narrow_and_broad_line_decomposition_for_j080427
f0afc88
muse_outflows__outflow_energetics
harbor run -d replicationbench@1.0 -t muse_outflows__outflow_energetics
f0afc88
muse_outflows__voronoi_binning_for_emission_lines_j080427
harbor run -d replicationbench@1.0 -t muse_outflows__voronoi_binning_for_emission_lines_j080427
f0afc88
trgb_std_candle__aseq_bseq_trgb
harbor run -d replicationbench@1.0 -t trgb_std_candle__aseq_bseq_trgb
f0afc88
trgb_std_candle__fit_aseq_bseq
harbor run -d replicationbench@1.0 -t trgb_std_candle__fit_aseq_bseq
f0afc88
trgb_std_candle__gaia_synthetic_i_trgb
harbor run -d replicationbench@1.0 -t trgb_std_candle__gaia_synthetic_i_trgb
f0afc88
trgb_std_candle__med_color_amp
harbor run -d replicationbench@1.0 -t trgb_std_candle__med_color_amp
f0afc88
ver_waves__gaia_breathing_typical
harbor run -d replicationbench@1.0 -t ver_waves__gaia_breathing_typical
f0afc88
ver_waves__gaia_rv_sample_size
harbor run -d replicationbench@1.0 -t ver_waves__gaia_rv_sample_size
f0afc88
ver_waves__solar_height_from_gaia_dr2
harbor run -d replicationbench@1.0 -t ver_waves__solar_height_from_gaia_dr2
f0afc88
ver_waves__sun_height_corrected
harbor run -d replicationbench@1.0 -t ver_waves__sun_height_corrected
f0afc88