Questions tagged [reward]

Use this tag in the context of reward functions for machine learning and especially reinforcement learning.

Use this tag in the context of reward functions for machine learning and especially reinforcement learning.

66 questions
5
votes
1 answer

Payment using reward points not showing up on checkout in Magento Enterprise Edition

I am working on a project in Magento Enterprise Edition and need to use the reward points system in the site. I set up the reward points system in the admin panel and allowed customer with 260 reward points. But when I tried to do the checkout,…
Akash Pius
  • 316
  • 4
  • 14
3
votes
0 answers

Reward videos number of plays limit

I am developing one android game. I am planning to integrate a reward videos in my app. I am planing to go for AdMob mediation. Below is my question. User can watch a reward video to save his/her life in game. So s and when he/she out, I will show…
Sam
  • 2,972
  • 6
  • 34
  • 62
3
votes
0 answers

Determine if a user is pledging on my Patreon

so I'm setting up a Patreon for a game I'm working on, and one of the pledge tiers allows the user to access an in-game chat that nobody else can access (other than other members of that tier). My game's chat engine allows me to create chats that…
3
votes
2 answers

Qlearning - Defining states and rewards

I need some help with solving a problem that uses the Q-learning algorithm. Problem description: I have a rocket simulator where the rocket is taking random paths and also crashes sometimes. The rocket has 3 different engines that can be either on…
mrjasmin
  • 1,230
  • 6
  • 21
  • 37
3
votes
3 answers

How should your company sponsor programming certification

Say your company is willing to sponsor the fees for taking programming certification examination. What is the best aproach to implement this? I would say if the programmer passes the exam at the first try, it shoud be fully sponsored. If the…
uuɐɯǝʃǝs
2
votes
1 answer

Why is the mean reward per episode of my PPO and DQN decreasing over time?

I am training an RL agent to optimise dispatching in a job shop manufacturing system. My approach is based on this code: https://github.com/AndreasKuhnle/SimRLFab. It migrates the environment to a gymnasium environment and updates the Python version…
2
votes
1 answer

Discount reward in REINFORCE deep reinforcement learning algorithm

I'm implementing a REINFORCE with baseline algorithm, but I have a doubt with the discount reward function. I implemented the discount reward function like this: def disc_r(rewards): r = np.zeros_like(rewards) tsteps =…
LRD
  • 351
  • 3
  • 13
2
votes
2 answers

How create multiple reward video's in Unity application?

the last few days I am trying to implement reward video's (admob) in my Unity app. I want to have multiple rewards video's people can watch, with different types of rewards. I feel like I am close (maybe not at all), since I have it working almost…
2
votes
1 answer

Convergence of the Q-learning on the inverted pendulum

Hello I'm working on a total control of the cartpole problem (inverted pendulum). My aim is for the system to reach stability meaning all the states(x, xdot,theta and theta) should converge to zero. I am using q-learning with a reward function as…
2
votes
2 answers

Unity RewardAd function call more time not just once

I want a RewardAd in my game. When you watch video you get +10 score to your current score not your high-score. You have a 45 high-score and you are now at 37, so you watch video for +10 score and you have 47 high-score it's fine. But if you do it…
antal1208
  • 85
  • 1
  • 7
2
votes
2 answers

Training of chess evaluation function

I am about to write a chess engine based on reinforcement learning. I'd like to train an evaluation function and figure out what are the weights of the board's most important features. I'm not an expert of machine learning, I'm trying to learn from…
2
votes
2 answers

Q-learning (multiple goals)

i have just started to study Q-learning and see the possibilities of using Q-learning to solve my problem. Problem: I am supposed to detect a certain combination of data, i have four matrices that acts as an input to my system, i have already…
1
vote
1 answer

After the ethereum merge, how can I know the reward address..?

Before the Ethereum merge. The miner received the fee or reward, and the miner was known by looking at the json rpc function "eth_getBlockByNumber". Now, I know that people who participated in staking get a reward. Has the json-rpc api been added to…
Jmob
  • 49
  • 7
1
vote
2 answers

Is it possible to use Facebook Credits as Rewards?

I would like to use Facebook Credits as Rewards, e.g. if a user reaches a next level or a high-score he can earn Facebook Credits. I'am planning to buy on my own (for the app) FB credits and then handing them out as rewards. In the API documentation…
1
vote
2 answers

Reinforcement learning does nothing when using test forex data

I am experimenting with RL and I am trying to write an AI so it can learn to trade the Forex market. Here is my code below: from gym import Env from gym.spaces import Discrete, Box import numpy as np import random import pandas as pd from…
1
2 3 4 5