Smash The Code - Feedback & Strategy

Wildcat-SC · May 9, 2016, 11:42am

For the 3 first days, I used an exhaustive 4 levels exploration (once for the points max, once for the heuristic max)

Then, I switched to a weighted Monte-Carlo tree favoring central positions and children max points. I forgot to weight the children points with the deepness, so it was not very effective

On the last days, I added an attack / defense system:

Exhaustive 3 levels exploration for P2 board
Attack was just selecting first 3 skull lines combo, or wait to get a bigger if there was no danger
Defense was trying to do a lower combo, if it had the ability to disturb a bigger P2 combo for more than 3 turns

[Edit] Yesterday, I tried to rewrite it completely to simulate a turn based strategy game with a real MCTS, but it never reached the RC state ^^

TylerDurden · May 9, 2016, 2:32pm

Hello ^^ 1st thing, Thx, was fun

League system=> nice ^^ have to be improved of course, but for a first time, was fine managed ^^ (at the beginnig, the step between bronze and silver was to high, but after the change middle week, was much better ^^)
Game itself=> cool, but i think that scoring must change when you will put it on multiplayer, or will ended with lot of ppl doing only tie fight, both finding the best combo fast… i think that just make “harder” to one shoot the other will help to evade that… (or maybe do 15-20 row…)

=> my strat=> Until legend was up ->Full scan 2 turns, me and opponent -> to find best combo at each turn, and best move for me at turn 1 to combo this turn, next, or later… and so, choosing carrefully ;p was enough for legend…

(but was determining “blocking” combo move, best strat concerning color, “Danger” value regarding space available for both of us,skulls already on tab… etc…)

When legend up, i begin to check later turns, taking my x best move, computing 22 next for all… selecting back my x best ont this x*22 moves, and go… i don’t like random ;p

But long WE, havn’t got the time to do it proper, and test how much i can up x to improve result without timeout… find some particular case where my choice was realy bad, but hard to redo against opponent using random ;p (and “not-to-good” opponent to let me the time to reach this choice ;p )

Pushed with x=15, crossed finger, and finished only 126… hard wait until i can correct it to test on multiplayer^^

Neumann · May 9, 2016, 6:32pm

Hi,

#31 here. I didn’t find inspiration during the contest, but here’s my strategy anyway :

First, I simulate every combination of pieces the opponent can play at depth=3 (it takes between 1 and 4 ms) and selects the best one with the formula : 5turn0 + 3turn1 + 1*turn3. I save this sequence of 3 pieces for later.

Like a lot of other contestants, I chose Monte Carlo to select the optimal play.
I tried to implement MCTS but I had performance issues (it requires a lot of board copies, and I had timeout issues related to the JVM) so I settled for a full random Monte Carlo (eww).

In order to limit the size of the tree to explore, my AI decides beforehand if it’s worth it or not to trigger an explosion at this very turn. If the sum of the three opponent plays that I previously chose (this with a 1 factor for every piece) is less than 1/4 of my board empty cells *70, then my AI will remove every piece that triggers an explosion from the possible plays for the first turn. It won’t even check if those pieces could trigger a BIG combo or not, it just ignores them …

I then generates and simulates as many sequences of 8 pieces as I can until I reach 100ms.

For the evaluation of the sequences, my AI computes a dynamic patience ratio. It’s based on the opponent score sequence I selected at the beginning, and basically scales with it :
I start with a 10 ratio, and for each turn, I reduce my ratio by (enemy_score_at_turn_X / score_that_kills_me). After turn 3 (I have no more vision on enemy score at this point), ratio is reduced by 2 at each turn. These dynamics ratio allowed my AI to favor combos that occurs before ‘the enemy potential shitstorm’.

I also keep track of the best sequences I get and reuse them for the next turn (I just add 1 more random piece at the end). I was able to simulate between 100 and 300k single-piece plays in 100ms.

Nothing too fancy but a descent simple AI.

R4N4R4M4 · May 9, 2016, 8:52pm

Hi, this challenge was very fun, good job !

My strategy is built on many many many parameters, and in my last submit, I changed just one which finally was not a good idea. The main questions were how to build colors to maximize the points potential, and when attack the opponent player.

Deep exploration here is only necessary to build the colors chain because deciding attack strategy with deep search would have made informations collect very complicated due to rules of the game and lack of data given about opponent actions and game status. So attack decision will depend only on points potential on the self current turn and the enemy next turn.

Also, I think game evaluation here is very important because potential color chains may depends on every move even those made in the beginning. The main criterias will be to keep an open game to maintain a lot of possibility for moves, which include maintaining space, playing in the center of the board, avoiding color closing when there is only on or two, and if possible, verifying the chain connections, which consists in evaluating the potential four or more colors groups. The other reason I think evaluation is very important is that every move can close opportunities in the future, even in the far end of the game.

So the key here will be knowing if a move is closing the game, opening it, and if it is building 3 colors groups connected with potential 4 or more colors groups.

The first part of the game, I used DFS, only on centered columns :

Number of turns for this first part : 8,
Deep : 4
First column : 1
Last column +1 : 5 (so last column is 4)
Evaluation : f1 * points + f2 * cumul-points + f3 * eval_position with f1, f2, f3 = 0, 1.2, 1

I tried to go deeper than 4 with a random rule based on proba decreasing with deep to not open some moves in exploration, but it was not very efficient, so I didn’t keep it (I didn’t had time to learn and implement MCTS) :

Probability to explore bad moves : 200 / ( deep * 1000)
Probability to explore good moves : 2000 / ( deep * 100)

I thought here that randomly eliminate moves was not a good idea for me for many reasons :

Every single move can potentially be a good move for the future destruct combo
Every move is strongly different from another, and can be very good or very bad in the far future
There is not a lot of possible moves (22) so eliminate one is very risky.
deep evaluation is mainly present in the game construction itself (like destruct chain), so time is better used in game evaluation of every early move than gaining deep in some random future move.
tree search algos is not the best skill I can improve in limited time

The second part of the game (after the 8th round), I used DFS too, but on all columns :

deep : 3
evaluation : same as before (see next parts for detail)

Each turn, I looked at the best enemy move in points on deep 1 and deep 2
Each turn I looked at my best move in points, and decided to strike based on some criterias :

Normal strike : points > 2300
Skull strike (if there are too much skulls) : points > 420 if at least 40 skulls
Space strike (if there is no more space : points > 420 if space left < 30 free cells
Fatal strike (chance to kill the enemy) if points >= (enemy space +1)* 70 : on / off
Defense strike : if enemy points > 2300 this move or next, and my points > 700

To build the game with chains of color :

Keep a free cell next to colors with bonus given : +10 (1 color alone) +50 (2 same colors) +150 (3 same colors)
and with malus if no free cell : -5 (1 color alone) -2(2 same) -1(3 same)
multiply this bonus + malus with a factor depending on number and proximity of the same colors in the AB colors queue
Adding a bonus for 3 same color groups to avoid bad moves destroying only 4 colors : +150
To compact same colors together :
Align colors on specific column: malus of - 4 * (color - column) if color - column > 1
(factor -8 here in last submit was the error because I only tested it in IDE and -4 was my best value balance during all my submits :(()
Add a malus for height : -2 * height
Malus if color is not on center : -30 * distance from center
Malus if a same color are in too far columns : -2 * width since witdh > 2
Verifying combo, comparing the 3 colors above each group of 3 with bottom, left and right colors : +50 for each different color found making a group of 4 or more if the group of 3 is destroyed

other stuff :

Malus to kill skulls based on height of skull : -10 * height * height
Malus to preserve space : -100 since space < 24
and many other things I finally didn’t keep,…

So, last 2 days were about… how combine all that ? That was my big problem…
My last error was changing one parameter at 7:56 PM in my last submission, although I had found a nice balance during all these days. Too curious last 4 minutes (rank 63).

To improve my parameters choice, I think I would have simulate them with a local simulator, because the color construction is very independent from the play strategy with the opponent. I didn’t got the idea but in fact I think that it is the best thing to do in this type of game.

This challenge was very interesting. The league system is a very good idea. Thanks to all for your good job.

anst · May 10, 2016, 11:28am

Great contest, again!
I was surprised how well gauged was the system of leagues, at least from the middle of the week.
On the other hand, it really felt need for speed – only 3 players with interpreted languages in the top 100.

And since performances are always discriminating, here goes my proposition.
Would it be possible to use Numba for accelerating Python?
This is a great LLVM-based JIT; works as a simple python module and… “a single function decorator results in a 1300x speedup of simple Python code”. Incredible as it sounds, but have a look at this.

Anther option would be a standard Cython addition, available in many distros. In short, it’s well-known compiler for Python that gives you C-like performances.

Any of these two should be not difficult to install and give real benefits for those who would like to use an “easy” language for contest.

pb4 · May 10, 2016, 7:27pm

For the people interested, here is a write-up of how I approached the contest.

As for the exact heuristics used in the “Board Quality”, they were very similar to those previously described by ranarama69.

Bandersnatch · May 10, 2016, 9:39pm

Hi,
I really appreciated the league system, it was way faster to be ranked after submission

For my strategy (16th, 1rst C#) :

Pure Branch & Bound for exploration with good metrics, no random : not that much nodes explored (~20k), but efficient. I memorized the tree each turn, and taken into account the most likely opponent’s move. I also simplified the tree for special cases and forgot a big one, just realized it monday along with the bug that cost me 40ms each turn ^^. Can’t wait for the multi game to be published so I can fix this xD

int33h · May 11, 2016, 7:17pm

I used bruteforce for the first 3 hints and then a genetic algorithm for the hints 4-8 choosing the best moves for each depth. The same strategy was used for inspecting enemy grid. Once you have the numbers it comes to deciding how greedy should you be in regards to getting more points and postponing your attack at the risk of being attacked by the enemy and possibly lose your whole board advantage.

I was ignoring potential enemy attacks with one line of skulls, focusing only on the 2+ ones. Also was disregarding deeper solution with more points if that wasn’t worth it, eg getting 450 points at step 6 is not that good as getting 400 points at step 3 but getting 3000 points on the other hand may be worth it.

When detecting a potential dangerous attack of 2 skulls or more I was switching from patience mode to counter-attack mode where I tried to score a number of points enough to add at least one line of skulls to enemy’s grid in fewer steps than the enemy would attack.

Unfortunately I finished my main algorithm in the last day of the contest and didn’t have time to play with the attack/wait parameters.

Final rank 251 - just 4 steps away from legend league

smeyer · May 13, 2016, 3:57pm

On the Coders Strike Back contest there was a Report button on the LeaderBoard page that let you see the code you submitted for the contest.

Is there a way to see my submitted code for this contest?

aLee2 · May 13, 2016, 4:52pm

you should have received an email with exactly the same kind of report, including you final submission… at least I did on 9th may

_CG_SaiksyApo · May 13, 2016, 4:57pm

Go to: https://www.codingame.com/leaderboards/global/challenge/smash-the-code/

Then replace :
/leaderboards/global/challenge/coders-strike-back/XXX -> /challengereport/XXX

smeyer · May 13, 2016, 6:24pm

Thanks that worked great.