Weights for the reward function during the model learning

Hi @jangirrishabh,
This repository is awesome and well-explained. I want to thank you for the great content and code. 
And I just have a question regarding the learning.py.
My question is: before training your NN model, you defined some weights in line: 
https://github.com/jangirrishabh/toyCarIRL/blob/2eff036e594a787299d1e4cc82e46f0f9b21308f/learning.py#L206
and fed them into the carmunk to get the immediate reward and the new state based on the taken action to update the Y vector in the mini-batch process method. I was wondering how you defined the weights (weights for the reward function). Because later, you use this trained model in the toy_car_IRL.py to update the policy and reconstruct the weights for the reward function. So do those weights affect the trained NN model or they are just some random values?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Weights for the reward function during the model learning #10

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Weights for the reward function during the model learning #10

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions