We consider an N-player multiarmed bandit game in which each player chooses one out of M arms for T turns. Each has different expected rewards the arms, and instantaneous are independent identically distributed or Markovian. When two more players choose same arm, they all receive zero reward. Performance is measured using sum regrets compared with optimal assignment to that maximizes rewards. a...