Submissions by Realistic-Row-8098 | Viewddit

[Question] DPO Implementation

For those that have implemented DPO: When you start the training process, do your reference model and your policy model need to have different initializations? The reason I ask is because it seems like if they are the same model to start, then the log probs will be 0 making the initial loss 0 and preventing any update from occurring. Am I missing something?

learnmachinelearning

-

1h

Realistic-Row-8098

1

1h

For those that have implemented DPO: When you start the training process, do your reference model and your policy model need to have different initializations? The reason I ask is because it seems like if they are the same model to start, the log probs will be 0 making the initial loss 0, preventing any update from occurring. Am I missing something?

[D] Simple Questions Thread

MachineLearning

Realistic-Row-8098

1

22h

😭

Unsure of what to major Engineering

Cornell

Realistic-Row-8098

0

1d

Some form of graduate school + self teaching.

Best ways to 'master' machine learning? [R]

MachineLearning

Realistic-Row-8098

6

1d

I don't think there's a distinction. Neither title is defined well though so who knows.

[D] MLE or AI Engineer?

MachineLearning

Realistic-Row-8098

2

3d

MLE is a very poorly defined label so that's understandable. It can range from anything to someone implementing ML infrastructure to an applied researcher. I would classify them broadly as the people who implement machine learning for a product/feature. This usually entails the whole life cycle from preparing data, choosing a model and designing a system/software around it, training the model, and integrating this into a product. Thus, they need to be proficient in ML and understand the field and the state of the art, and also have some SWE skills. In my opinion a master's is a really good level of education for this type of role.

How is AI/ML saturated when they need MS/PhD?

csMajors

Realistic-Row-8098

6

3d

I'm confused what your point is. The people that are qualified will get the jobs and the people that aren't won't. Most of the former will have graduate degrees. Even the "machine learning engineers" (why are you bashing this role while claiming to not know much about ML? It still requires a solid depth of knowledge and to be able to understand current research).

How is AI/ML saturated when they need MS/PhD?

csMajors

Realistic-Row-8098

44

3d

Ivy plus is actually a library system tho haha

What are some of your college admissions unpopular opinions?

ApplyingToCollege

Realistic-Row-8098

1

5d

It seems like the first half of the courses are pretty similar (topic-wise at least, cant speak on difficulty/depth of problem sets).

The second half of 4220 (topics on optimization) seems a bit more practical than the second half of 6210.

cs/math schedule for senior year: useful classes?

Cornell

Realistic-Row-8098

1

5d

I haven't taken 4220, but after looking at the syllabus I think I'd recommend it over 6210 for most people tbh. Also 6210 is very assignment based when Damle is teaching it.

cs/math schedule for senior year: useful classes?

Cornell

Realistic-Row-8098

1

5d

Yeah, I was just saying that because rigorous academics may hurt the "college experience" aspect since people are so busy studying.

what acc to you is the perfect school for undergrad education?

ApplyingToCollege

Realistic-Row-8098

1

5d

Why aren't you releasing the weights?

[R] Grounding DINO 1.5 Release: the most capable open-set detection model

MachineLearning

Realistic-Row-8098

1

6d

Isn't Stanford pretty rigorous though?

what acc to you is the perfect school for undergrad education?

ApplyingToCollege

Realistic-Row-8098

2

7d

Looking over the syllabus of 4220, I would say not a lot unless you want to do research in NLA.

cs/math schedule for senior year: useful classes?

Cornell

Realistic-Row-8098

2

7d

Facts 💀

What are some good tech companies outside of FAANG?

csMajors

Realistic-Row-8098

11

7d

FEIN

What are some good tech companies outside of FAANG?

csMajors

Realistic-Row-8098

2

7d

Given the classified nature of defense work, the fact that this is a US government board, and the increased importance of ML in defense these days, it makes a lot of sense to me.

[D] US governments AI safety and security board! Is it a fair list?

MachineLearning

Realistic-Row-8098

1

8d

Two examples of what I believe is the SOTA multimodal pretraining technique is in the Llava paper and the Qwen Audio paper. Essentially, they freeze the LLM during pre-training, and create an encoder that encodes the stuff other than text into the frozen LLMs input space. Then the LLM is finetuned on multimodal instructions. This way the LLM can "understand" multimodal data without forgetting its text understanding.

[D] GPT-4o "natively" multi-modal, what does this actually mean?

MachineLearning

Realistic-Row-8098

2

9d

CS 5414 is a huge time sink. Proceed with caution if the rest of your schedule is tough. The content seems more geared towards someone looking to do large scale distributed systems R&D work so I'm not sure how useful it is to a SWE or someone in quant finance. I work in an ML role and not in the above fields so I could definitely be wrong about that.

I would recommend something like 5220 (HPC) over 5414.

6210 is another class I enjoyed, but probably better for people looking to solidify their linear algebra skills.

cs/math schedule for senior year: useful classes?

Cornell

Realistic-Row-8098

1

9d

Hi,

I'm looking to hire someone to help with audio captioning. The ideal candidate has extensive audio engineering and music theory knowledge. If you're interested please message me so we can talk compensation, captioning directions, volume of data (this is flexible), and your qualifications. You'll also get free samples if you so desire due to the nature of the task.

Weekly Marketplace Thread (May 20, 2024)

edmproduction

Looking to hire someone for audio captioning.

Moderator removed post

audioengineering

0

9d

Realistic-Row-8098

2

10d

Cornell has a good OR program too.

Average income 4-years post graduation

csMajors

Realistic-Row-8098

1

11d

Wildflower preserve parking is the main spot

Where do you park for Second Dam?

Cornell

Realistic-Row-8098

4

11d

Would definitely be interested in seeing some recent learning theory work here.

[D] What would you like to see more or less of in /r/machinelearning?

MachineLearning

Realistic-Row-8098

2

11d

Yes, time the market.

Market crash readiness

stocks