Preprocessing issue in Data Analysis of ttbar using DeepCSV

KAUSHIK_GUPTA · September 25, 2024, 6:15am

Hello Everyone,
I’m trying to learn the machine learning tool to discriminate b-tagging among c and light quark jets.
In the pre-processing, when I’m doing the following step:

Padding the missing value to zero.
Scaling the variable and then adding the offset to each feature.
I have also tried input_value = input_value - feature.mean().

All the information are collected from the github: RecoBTag-Combined/DeepCSV_PhaseI.json at master · cms-data/RecoBTag-Combined · GitHub

Data is taken from the latest:
/eos/cms/store/group/phys_btag/ParT_2024/ttbar_test

But their is discrepancy between the model predicted and actual jet tag.

Let me know if I’m missing anything in this.

Danilo · September 25, 2024, 6:48am

Hello,

Thanks for the post.
This looks more like a question for the CMSTalk forum: have you tried to post this on that platform?

Cheers,
D

system · October 9, 2024, 6:49am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.