Java Environment Exemples

Langage de programmation: Java

Espace de nommage/Pack: com.ideaheap.mdp.sim

Class/Type: Environment

Exemples au hotexamples.com: 2

Java Environment - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de com.ideaheap.mdp.sim.Environment extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

getRewardForPos(2)

isTerminalState(1)

Méthodes fréquemment utilisées

getRewardForPos (2)

isTerminalState (1)

Associées

PushpinIF

Messages

GUI_ID

org.apache.hw_v4_1_0.hedwig.client.api.Client

GenericObjectEditor

Class

GridReplicatedLockFuture

ObjCRuntime

PyFunctionBuilder

ILayer

Related in langs

formateSize (PHP)

Good (PHP)

PrimitiveIsNegativeL1 (C#)

BattleTeamFightRecord (C#)

like (C++)

gwlist_destroy (C++)

NewTSDBQueueManager (Go)

IncomingWebhookListToJson (Go)

get_overcloud_endpoint (Python)

score (Python)

Exemple #1

0

Afficher le fichier

Fichier : Utility.java Projet : nwertzberger/ai-class

public void updatePolicy(Policy policy, double discount) { double change; double maxError = MAX_ERROR * (1 - discount) / discount; do { change = 0; // Bellman step Utility u = new Utility(this); for (int y = 0; y < rows; y++) { for (int x = 0; x < cols; x++) { Pos p = new Pos(x, y); double reward = env.getRewardForPos(p); if (env.isTerminalState(p)) { util[y][x] = reward; } else { util[y][x] = reward + discount * policy.getAction(p).getProbableReward(p, u, env); } } } // get most changed for (int y = 0; y < rows; y++) { for (int x = 0; x < cols; x++) { double diff = Math.abs(util[y][x] - u.util[y][x]); if (diff > change) { change = diff; } } } } while (change > maxError); }

Exemple #2

0

Afficher le fichier

Fichier : Utility.java Projet : nwertzberger/ai-class

public Utility(Environment env) { this.rows = env.rows; this.cols = env.cols; this.env = env; util = new double[env.rows][env.cols]; for (int y = 0; y < env.rows; y++) { for (int x = 0; x < env.cols; x++) { util[y][x] = env.getRewardForPos(new Pos(x, y)); } } }