Java Traces 예제들

프로그래밍 언어: Java

네임스페이스/패키지 이름: rltoys.algorithms.representations.traces

클래스/타입: Traces

hotexamples.com에서의 예제들: 3

Java Traces - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Java의 rltoys.algorithms.representations.traces.Traces에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

clear(2)

newTraces(1)

update(1)

vect(1)

예제 #1

파일 보기

파일: QLearning.java 프로젝트: amw8/rlpark

 public double update(RealVector x_t, Action a_t, RealVector x_tp1, Action a_tp1, double r_tp1) {
   if (x_t == null) return initEpisode();
   Action atp1_star = greedy.decide(x_tp1);
   RealVector phi_sa_t = toStateAction.stateAction(x_t, a_t);
   delta = r_tp1 + gamma * greedy.bestActionValue() - theta.dotProduct(phi_sa_t);
   if (a_t == at_star) e.update(gamma * lambda, phi_sa_t);
   else {
     e.clear();
     e.update(0, phi_sa_t);
   }
   theta.addToSelf(alpha * delta, e.vect());
   at_star = atp1_star;
   return delta;
 }

예제 #2

파일 보기

파일: QLearning.java 프로젝트: amw8/rlpark

 public QLearning(
     Action[] actions,
     double alpha,
     double gamma,
     double lambda,
     StateToStateAction toStateAction,
     int nbFeatures,
     Traces prototype) {
   this.alpha = alpha;
   this.gamma = gamma;
   this.lambda = lambda;
   this.toStateAction = toStateAction;
   greedy = new Greedy(this, actions, toStateAction);
   theta = new PVector(nbFeatures);
   e = prototype.newTraces(nbFeatures);
 }

예제 #3

파일 보기

파일: QLearning.java 프로젝트: amw8/rlpark

 private double initEpisode() {
   if (e != null) e.clear();
   delta = 0.0;
   at_star = null;
   return delta;
 }