Analysis and control of stochastic systems


Markov decision theory

S. Mannor and J. N. Tsitsiklis, "Algorithmic Aspects of Mean-Variance Optimization in Markov Decision Processes", European Journal of Operational Research, Vol. 231, 2013, pp. 645-653.

J. N. Tsitsiklis, "NP-Hardness of Checking the Unichain Condition in Average Cost MDPs," Operations Research Letters, Vol. 35, No. 3, May 2007, pp. 319-323.

S. Mannor and J. N. Tsitsiklis, "On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies", Mathematics of Operations Research, Vol. 30, No. 3, August 2005, pp. 545-561.

D.P. Bertsekas and J.N. Tsitsiklis, "An Analysis of Stochastic Shortest Path Problems", Mathematics of Operations Research, Vol. 16, No. 3, August 1991, pp. 580-595.

G. H. Polychronopoulos and J. N. Tsitsiklis, "Stochastic Shortest Path Problems with Recourse", Networks, Vol. 27, No. 2, 1996, pp. 133-143.

H.N. Psaraftis and J.N. Tsitsiklis, "Dynamic Shortest Paths in Acyclic Networks with Markovian Arc Costs", Operations Research, Vol. 41, 1, 1993, pp. 91-101.

C.-S. Chow and J.N. Tsitsiklis, "An Optimal One-Way Multigrid Algorithm for Discrete-Time Stochastic Control", IEEE Transactions on Automatic Control, Vol. AC-36, No. 8, 1991, pp. 898-914.

C.-S. Chow and J.N. Tsitsiklis, "The Complexity of Dynamic Programming", Journal of Complexity, Vol. 5, No. 4, 1989, pp. 466-488. (errata)

C.H. Papadimitriou and J.N. Tsitsiklis, "The Complexity of Markov Decision Processses", Mathematics of Operations Research, Vol. 12, No. 3, 1987, pp. 441-450.

See also Neuro-Dynamic Programming


Stochastic approximation

V. R. Konda and J. N. Tsitsiklis, "Linear Stochastic Approximation Driven by Slowly Varying Markov Chains", Systems and Control Letters, Vol. 50, No. 2, 2003, pp. 95-102.

V. R. Konda and J. N. Tsitsiklis, "Convergence Rate of Linear Two-Time Scale Stochastic Approximation", Annals of Applied Probability, Vol. 14, No. 2, 2004,

D. P. Bertsekas and J. N. Tsitsiklis, "Gradient Convergence in Gradient Methods with Errors," SIAM Journal in Optimization, Vol. 10, No. 3, 2000, pp. 627-642.

J. N. Tsitsiklis, "Asynchronous Stochastic Approximation and Q-learning", Machine Learning, 16, 1994, pp. 185-202. Correction.

J. N. Tsitsiklis, D. P. Bertsekas and M. Athans, "Distributed Asynchronous Deterministic and Stochastic Gradient Optimization Algorithms," IEEE Transactions on Automatic Control, Vol. 31, No. 9, 1986, pp. 803-812.


Multi-armed bandit and search problems

P. Rusmevichientong, and J. N. Tsitsiklis, "Linearly Parameterized Bandits", Mathematics of Operations Research, Vol. 35, No. 2, May 2010, pp. 395-411. [extended version]

A. J. Mersereau, P. Rusmevichientong, and J. N. Tsitsiklis, "A Structured Multiarmed Bandit Problem and the Greedy Policy", IEEE Transactions on Automatic Control, Vol. 54, No. 12, December 2009, pp. 2787-2802; short version in the Proceedings of the 47th IEEE Conference on Decision and Control, Cancun, Mexico, December 2008.

S. Das and J. N. Tsitsiklis, "When is it Important to Know You've Been Rejected? A Search Problem with Probabilistic Appearance of Offers", Journal of Economic Behavior and Organization, Vol. 74, No. 1-2, 2010, pp. 104-122.

J. Sethuraman and J. N. Tsitsiklis, "Stochastic Search in a Forest Revisited," Mathematics of Operations Research, Vol. 32, No. 3, August 2007, pp. 589-593.

S. Mannor and J. N. Tsitsiklis, "The Sample Complexity of Exploration in the Multi-Armed Bandit Problem," Journal of Machine Learning Research, Vol. 5, June 2004, pp. 623-648.

D. Bertsimas, I. Paschalidis and J. N. Tsitsiklis, "Branching Bandits and Klimov's Problem: Achievable Region and Side Constraints" , IEEE Transactions on Automatic Control, Vol. 40, No. 12, December 1995, pp. 2063-2075.

J. N. Tsitsiklis, "A Short Proof of the Gittins Index Theorem", Annals of Applied Probability, Vol. 4, No. 1, 1994, pp. 194-199.

J. N. Tsitsiklis, "A Lemma on the Multi-Armed Bandit Problem", IEEE Transactions on Automatic Control, Vol. 31, No. 6, 1986, pp. 576-577.


Stochastic Scheduling and Inventory Control

D. Shah and J. N. Tsitsiklis, "Bin Packing with Queues," Journal of Applied Probability, Vol. 45, No. 4, December 2008, pp. 922-939.

N. Sabbaghi, Y. Sheffi, and J. N. Tsitsiklis, "Coordination capability of linear wholesale price contracts ", technical report LIDS-P-2749, Laboratory for Information and Decision Systems, MIT, February 2007.

A. Muharremoglu and J. N. Tsitsiklis, "Dynamic Leadtime Management in Supply Chains," unpublished manuscript, June 2003.

A. Muharremoglu and J. N. Tsitsiklis, "A Single-Unit Decomposition Approach to Multi-Echelon Inventory Systems", Operations Research, Vol. 56, No. 5, September-October 2008, pp. 1089-1103. Electronic companion (appendix).

B. Van Roy, D. P. Bertsekas, Y. Lee, and J. N. Tsitsiklis, "A Neuro-Dynamic Programming Approach to Retailer Inventory Management", November 1996. Short version in Proceedings of the 36th IEEE Conference on Decision and Control, San Diego, California, December 1997, pp. 4052-4057.

C.H. Papadimitriou and J.N. Tsitsiklis, "Stochastic Scheduling with In-Tree Precedence Constraints", SIAM Journal on Computing, Vol. 16, No. 1, 1987, pp. 1-6.

J.N. Tsitsiklis, "Periodic Review Inventory Systems with Continuous Demand and Discrete Order Sizes", Management Science, Vol. 30, No. 10, 1984, pp. 1250-1254.

J.N. Tsitsiklis, "Convexity and Characterization of Optimal Policies in a Dynamic Routing Problem", Journal of Optimization Theory and Applications, Vol. 44, No. 1, 1984, pp. 105-136.

G. D. Stamoulis and J. N. Tsitsiklis, "Optimal Distributed Policies for Choosing Among Multiple Servers", Proceedings of the 30th IEEE Conference on Decision and Control, Brighton, England, December 1991, pp. 815-820.


Simulated annealing

J.N. Tsitsiklis, "Markov Chains with Rare Transitions and Simulated Annealing", Mathematics of Operations Research, Vol. 14, No. 1, 1989, pp. 70-90.

J.N. Tsitsiklis, "A Survey of Large Time Asymptotics of Simulated Annealing Algorithms", in Stochastic Differential Systems, Stochastic Control Theory, and Applications, W. Fleming and P. L. Lions, (Eds.), Springer-Verlag, New York, 1988, pp. 583-599.

D. Bertsimas and J. N. Tsitsiklis, "Simulated Annealing", Statistical Science, Vol. 8, No. 1, 1993, pp. 10-15.


Computation with noisy gates

N. Pippenger, G.D. Stamoulis, and J.N. Tsitsiklis, "On a Lower Bound for the Redundancy of Reliable Networks with Noisy Gates", IEEE Transactions on Information Theory, Vol. IT-37, May 1991, pp. 639-643.