Fall Challenge 2020 - Feedbacks & Strategies

Magus · November 24, 2020, 7:32pm

I answer to a question here.

I received a PM from @semera

I respond here because the answer can be interesting for other people.

“Patience coefficient” is probably not the “school” term. I suppose it is a “custom codingame term” like many others. So sorry, I don’t know what you should write into google But I can explain what is it:

Let’s say you want to evaluate a path of your simulation. Take this 2 paths for example:

CAST 1 1 BREW 2 REST CAST 4 1
BREW 2 CAST 1 1 REST CAST 4 1

If you just evalute the game state at the end of the 4 turns, the score will be the same. But this is not what you want. You want your AI to take the second path, because a brew now is better than a brew later.

This is why we use a “patience coefficient”. If the brew reward 5 points (for example), the brew at depth 0 will reward 5 * PATIENCE_COEF^0 and the brew at depth 1 will reward 5 * PATIENCE_COEF^1.

You can use whatever patience coefficient between 0 and 1. A “common” patience coefficient I saw in many contests on codingame before is 0.7. For the Fall Challenge 2020, according to the differents PM, a good value seems to be between 0.95 and 0.99.

R4N4R4M4 · November 24, 2020, 7:43pm

Yes Magus, but if you see the opponent will brew the second potion (with +1 bonus) and you will brew the first (with +3 bonus), you would want to let him take this potion with +1 bonus and take yours the turn after, or at the same time, to not let him grab the +3 bonus… I agree, in this case, your eval would say it

reCurse · November 24, 2020, 8:35pm

It’s a discounted reward.

Agade · November 24, 2020, 10:12pm

I finished 2nd place and here is my postmortem.

Jahz · November 24, 2020, 10:42pm

I finished 240 using Java with a depth 4 simple MC with ~30K average unitary simulations done per turn.
Nothing fancy in there, but at least this time I managed to avoid objet allocations in the loops and escaped the GC’s timeout nightmare.

I precomputed during turn 1 the use of all spells (with all their repetitions) on any of the 1001 inventories, but this was probably not a good investment of my time since it didn’t led to any major performance gain.
I’m still amazed that for the “learn phase” I wasn’t able to find anything better than “learn first spell for first 8 turns”, especially when it’s really not the case in the boardgame version according to this, but that’s how it was.

Thanks to CG once again for a great challenge !

Congrats to my Amadeus teammates for reaching #1 in the company leader-board !

Congrats to the top 3 pb4 / Agade / reCurse (and thanks for sharing your PMs as always), it was fun to see you battling each other a bit like Tennis’ big 3 Federer/Nadal/Djokovic
Wish my bro Saelyos (the “nextgen” ?) could have competed with you, but it wasn’t to be the case this time !

Fangel · November 24, 2020, 10:57pm

Hi all !
First of all thanks to CG and all participants. I think this challenge is one of my favorites, as it was quite original, with nice graphics. I finished #53 in legend. I learnt a lot on this challenge and on the other hand, I really failed some parts of my AI and still wonder what to do. I have some regrets on the global efficiency of the matching… I suppose it is due to the great number of participants. But well, I had to wait more than 2 hours to finish my matches. In comparison to previous challenge, I felt to have done 5 times less matches. The backfeed on modifications was thus very slow…

My strategy was a beam search depth 15 width 200 evaluating on the score… Yes on the score because I did not manage to find better eval… Even spending hours trying ! I considered opponent only to predict a coming end (with depth 5). My learn was first basic (7 cheapest ones), but I tried to give values on different spells to see if buying a better one was efficient… Not sure I succeeded. So I won’t talk anymore of the heuristic as I really feel to have failed it.

So on what my AI was good ? clearly on data structure IMHO. For me, a node of the beam search contained 3 things:

the inventory (coded in [0-1000], as explained by other PM. Every inventory transformation (brew, cast, spell tax) was precomputed for this [0-1000] inventory states).
the achievement/readiness state on 32 bits: 5 for brew, 6 for learn, 21 for readiness (no need for more really).
the root move, which is only the first move of the series I propagate until end, so no need to backpath.

So the nice trick was to check if a state or better was already visited. I used BFS, so I solved depth after depth. A state is clearly better than another if it has the same inventory state and a better achievement/readiness state, that is to say at least the same bits at 1 (and possibly more). One can check it simply on A and B, bitarrays, if A&B==A, B is a better (or equal) state.

So I built a list of 1001 lists of the encountered achievement/readiness states. When I find a new element A, I go to the list of the inventory index and compare it to every element B in the list.
If I find B better than A, then I already crossed a better state, I break and trash A.
If I find A better than B, then I can break because B already passed the check before (so A would do it easily) and I replace B by A.
If I reach the end, I append A.

I first thought this check would kill my perfs. But finally it revealed to be very efficient, as A tends to improve with depth, the number of comparisons does not grow that much and the pruning is really worth. Well to be honest I found the final form of this algorithm on last sunday at 2.00 am and was not confident enough to submit it

Note on the root move that I always solved in order for a depth, all REST, then CAST, then LEARN, then BREW. So by construction, BREW > a > b > c always come before in the check than a > b > c > BREW, which prevents doing a uselessly risked REST > BREW.

Now some additional thoughts, this game was nearly solo-player. The main interaction in my opinion is on the 5 first turns when both players are buying learns at the time and at the end when one stops the game. Even the concurrence on the brews is really small. Most of the games in top legend last between 32 and 40 turns. And the one who ends generally wins. Even if he took smaller potions, it has the decision to end the game in his hands and the last move is necessarily a score step for him with the last potion.
When I did matches vs top 5, I generally lost with only 3-4 potions, and I really believe that they win because they are faster. I even won some matches vs them, when my AI was luckily fast and brewed the 6th before them. I tried some match vs the dummy AI (wood boss isn’t it ?). And yes don’t worry I won all of it, but I realized that the matches last between 38-48 turns. So yeah, there is only one ressource in this game (like many other), it’s time and winners just used it better, congratulations to them !

Zakaoai · November 25, 2020, 12:21am

My path to be top #1 … In silver

So hello dear reader, Zakaoai here !

My final ranking are :

#544 / 7011 Global
#1 / 1913 Silver
#9 / 592 JavaScript
#191 / 2730 French Player

Introduction

For me, this challenge was the challenge of Performance against the choice of your programmation language.

I choosed JavaScript but was it a good choice ? Certainly not but I choosed it anyway.

The challenge

This challenge was really enjoyable. The rules and objectives are simple but there was a tons of combinations possible. Futhermore, the challenge theme was my favourite. I’m a huge fan of Zelda’s series

I will now explain how I reached my final Rank #1 Silver

Wood

Well for the Woods ranks, I need only some few functions. The goal was to brew only 1 and 3 potions.

I made function for :

Dealing with my current inventory
Filter castable spells if I have the ingredients
Brew only the best potions in the list

Bronze

For the bronze league, I do some overkill functions that I thought at the begining of the challenge can be a good BFS but it wasn’t.

The main idea was to precalculate with my inventory all the self sustainable of castable spells.

With this 1st idea, I reach fastly the limit of JS performance and that was at this moment of the challenge where I ask some questions to the Community.

Thanks a lot for all your advices Mazelcop.

Bronze to Silver

So after all, I rework my first Idea by starting with 5 states ( 1 by potion) and search the best spell’s chain that can help me to brew the potion.

My “fake” BFS make up to chain of 5 spell’s chain and I can repeat it 3 times their combinations. After I got the list of all actions, I store it in a List and play these actions without checking the game state.

NB : don’t do that The game state update every turn. That’s why it’s a “fake” BFS.

But with some luck, I was pushed in Silver.

Silver Before Gold

So here I am the final League but there was multiple “state” of silver league because before Gold we were a lot of player in it.

An easy code rank #500 Global - Best Than Nothing

So since I obtain my silver rank, I decide to erase all my code and write a simple function I called it “Best Than Nothing”.

The main idea was : If we dont know what to do, what can be a costless function to play other actions than ‘WAIT’

For this I used for the 1st time and until my last submit the famous META call Learn 8 Turns which mean to learn only at the beginning of the game.

So for the algo of Best Than Nothing


If a potion can be BREW with my current inventory

    I BREW the potion

Else

    I check all spell that I can CAST with my current inventory

    I sort the by the gain of the cast with the idea of price / ingredient [1,2,3,4]

    I cast the best one

If I can't cast a spell, I use REST

With this I was top 500 for at least 1 day. It was a funny joke for me to be this good with such a random function that only focus on production of ingredient.

Silver After Gold + Legend

Then, Gold and Legend League appears. The best player did beat the Bossdorf but I didnt reach my main goal of Gold. At least I’m Top #1

Begin Simulations, BFS For the win ?

So with the code I made, I decided with the recommandation of the community to do an evaluation function.

Not a BFS at first only evaluate one turn of play to do always the best action.

Then, I transformed it into a BFS.

The algo was :


Begin with the Initial State ( Castable Spells List, Inventory)

Add this state into a list

While step below a defined depth (I can't go over 4 / 5 with Beam Search)

    For each states in the list search all child state ( using flatMap I can use only one list)

        Childs states :

            Each Castable spells with my inventory

            Each Brewable potion with my inventory

            Rest

    End For

End While

I try to do a Beam Search to upgrade the performance of my BFS but at the cost of lot of result. I can only achieve Depth 5 with Limitation of 50

Beam Search is a good solution when you know your evaluation of all nodes is good enougth to take only best nodes. And without good perfs Beam Search can’t achieve a good rank.

For this at the end of each For loop, I sort all node and take only the 50 best one. That also kill the perfs when you sort a lot of node. So don’t use sort to do that.

Upgrade Performance of the BFS

The next days after this 1st attempt of BFS was to do all I can for the performance of BFS in JavaScript.

Each day, I talked with community members and I read how many states others languages can check. Before my optimizations, I can only check 1000 states vs 20k after optimizations. Others players can go over 100k states, so it was really frustrating.

So here are all my tips to optimize your JS BFS by order of how I implemented them :

Pre Calculate all you can Turn 0 with all data you know ( even in the github project of the challenge)
- Sum of all possible inventory (1001 possible states) with all known actions
- Ratio of completion for all Potions with all inventory states
- All maths function we use multiple times like Math.pow(0.9,n)
Create an hashtable of states to duplicate Node (hash based on inventory state and spell state) See binary representation bellow for a nice exemple
Use some tips to optimize BFS for the garbage collector and without queue Coding Game Optimizing BFS
Use standard function over JS function even if bench dont say so ( JS BENCH sites check only execution performance without thinking of Garbage Collector) so replace map and foreach with simple for
Optimize all objects instanciations to minimize impact of Garbage Collector. Use global constant for every string or generic object.

Last Optimization get rid of duplicate array for spells and potions between state

It was the first time I deal with binary operations since I was in school. So it was like I learn again how to use them.

So my objectives behind these binary operations was to never instanciate arrays for all my states and only store a binary representation of those arrays.

So I need to store a global array of arrays of all indexs combination for spells and potions list.

Exemple :

With Learn 8 strat we have 12 spells

So we can think of thoses spells with 1 bit for each spell (0 -> used spell, 1 -> castable spell)

So IIIIIIIIIIIII -> All spells are up. the number is 4095 (2^(12) - 1)

With this, I associate to this number a list of index of castable spell only

IIIIIIIIIIIII -> 4095 -> [0,1,2,3,4,5,6,7,8,9,10,11,12]

IIIIIIIIIIII0 -> 4094 -> [1,2,3,4,5,6,7,8,9,10,11,12]

With this list in global constant, I finally get rid of array instanciation between states of my BFS when I BREW a potion or CAST a spell I only store a binary number.

For the inventory it was an other method. I used 4 bit by type of ingredient. So the math function behind is :


function hashInv(i, j, k, l) {

    return i + 16 * j + 256 * k + 4096 * l;

}

With both of those binary numbers, you can achieve a good Hash of your state. So for combination I did a binary sum of these 2 numbers with a little trick :


(invBinary << mySpells.length) + spellsBinary

I shift the binaryNumber of my inventory with the length of my spells array. You’ll have finally a unique number for all your states.

My Final BFS

Finally here it is. The final representation of my BFS

Node


State :

{

    action,

    myBinarySpells,

    binaryPotion,

    invHash,

    score 

}

Child Nodes

binaryPotion -> Brewable Potions

myBinarySpells -> Castable Spells

if myBinarySpells < pow(2,spell.length) -> Action Rest

Evaluation

Score only BrewablePotion :

Brewable Potion Price + Ratio * Price of the nearest Potion with the inventory after the brew.

Recommendation for future challenge

Don’t miss steps. Write ideas but dont rush to code them without thinking if it worth the time
Try to translate all game datas to simples objects. A list of castable spells can be seen as a binary number with some imagination.
Prepare your generic functions to deal with timeout, logs, some math operations.
Split and comment your code as much as possible during the contest. When we need help it’s better if we can explain how we code our solution. Also, evolve a well splitted code is faster than a single function with hundreds lines of codes.

Special thanks to some French CG player who helped me even until the last minutes of the challenge :

Mazelcop
Fangel
Gh0stM4chine
All Fr community in CG chat

Finally, I made a YT video in French to talk about my code. How I upgrade all my function. It’s a long 4h30 talk where I interact with some CG player Debrief + Post Mortem

NB : Without new submit I achieve to be gold

#476 Global

#377 / 467 Gold

qbit86 · November 25, 2020, 12:38am

Don’t you need two separate (independent?) bitsets for castable (not exhausted) spells and learned spells?

sgalasso · November 25, 2020, 1:07am

Hi everyone !
I finished Gold, 389 overall

There’s my postmortem:

I would be happy to have suggestions on it !
Thanks

Dahll · November 25, 2020, 2:06am

How are you getting the path of actions of your AI ? I know how to get it with a MinMax but I have no idea with beam search …
Thks !

pb4 · November 25, 2020, 7:36am

Welcome back

By the way, I finally got my revenge from Coders of The Carribean in 2017 !

Fall Challenge 2020

Coders of the Carribean 2017

Magus · November 25, 2020, 8:47am

I just use a simple vector<Action>.

Pseudo code :

class Node {
    vector<Action> actions;
    Game game;
    double eval;
}

Node parent = nextNodeToExpand();
vector<Action> actions = computePossibleActions(parent);

for (Action action : actions) {
     Node child = new Node();

     child.actions = parent.actions();
     child.actions.push(action);
     child.game = parent.game;
     child.game.play(action);
     child.eval = parent.eval() + child.game.eval() * pow(COEF_PATIENCE, depth);

     addChildToTheListOfNode(child);
}

skyyker · November 25, 2020, 9:44am

On my side, I only have one action per Node but I keep a link to the parent.

class Node{
    	Action action;
    	Node father;
    	double score;
}

Thus to retrieve the list of actions, I will loop until the node without father:

public List<Node> getNodes() {
    		List<Node> nodes = new ArrayList<>();
    		Node rootToTop = this;
    		while (rootToTop.father!=null) {
    			nodes.add(0, rootToTop);
    			rootToTop = rootToTop.father;
    		}
    		nodes.add(0, rootToTop);
    		return nodes;
    	}
    }

tutubalin · November 25, 2020, 10:08am

Wow! I independently invented totally the same approach.

Saelyos · November 25, 2020, 12:18pm

I tried something similar, but not with more success ^^. In order to find better moves, I run 2 BFS for me and the opponent to prune automatically some moves which take more times, and to have another pruning by evaluating the final nodes to eliminate the bad ones. My idea was then to explore those trees with a smistsimax (which performed really badly) or a DUCT (it was better, but still bad).

I still don’t completely understand why it didn’t work, but anyways it was not the good approach and I didn’t have the time / motivation to write something different.

SWi · November 25, 2020, 12:33pm

Final rating: 621st place in Silver league, 1177th place in global leaderboard

The algorithm I’ve decided to use was simple BFS. Using this approach, I was looking for the potion I could brew in the least amount of turns. After that I would execute the first action on the path from the current state to the state returned by the BFS. Besides that, during the first 10 turns I would learn a free spell with probability 9/10 (with probability 1/10 I would run BFS). The differences between my algorithm in wood, bronze and silver division were simply some bug fixes and taking into account new rules.
I didn’t like this challlenge, after implementing BFS I wasn’t really interested in trying to improve my algorithm.

blasterpoard · November 25, 2020, 2:21pm

I don’t entirely understand why did my rollouts work so well either; I also tried sorting the possible moves not by how much they increase the value of my inventory, but by distance from the closest gamestate where I’ve brewed all the potions (either 5, or how many I need to win) while ignoring that I need to rest, which reduced the size of graph to only 32032 nodes. Moves minimizing this distance should be relatively good (I later used this distance to prune the beamsearch when calculating statistics I mentioned in my PM), but it still performed much, much worse (only 30% winrate vs my bot) for some reason I fail to see.

qbit86 · November 25, 2020, 3:44pm

How do you implement vanilla beam search and beam stack-search? Could you please share a link to pseudocode of both? I failed to find good descriptions, free of confusing context like neural networks.

Do you need to use some priority queue (n-ary heap) for the frontier? Then you need to evaluate all intermediate nodes, not only leaves? Is evaluation function the same for leaves and inner nodes?

NicolasH_42 · November 25, 2020, 7:37pm

First CG challenge for me. Ended gold 1 hour before the end of the challenge (400th of gold).
I’ve take part in the challenge to learn Rust and i aim for Gold so i’m happy since both of my goals are completed.
I’ve learn in bonus a lot in graph search and real time (So many TOs).
My strategy for the challenge :
2 days before the challenge : made an account CG, did some puzzles in Rust and setup my dev env.
Wood : I was checking all possible actions for the turn, sort them by price and take the first one.
I’ve set security stock of 2 meaning that i dont cast spells if it lower my ingredients below my security stock so my bot had some of each tiers to brew.
I’ve manage to get bronze without a search, i was looking only for the current turn.
Bronze & Silver : First i just add a basic learn that took the production spells. (not repeatable ones). Then i needed a search.
I’ve quickly got a BFS but it was very slow because i each node was big (700 octets) and the copy was taking a lot of time (using smaller type was a big optimisation).
My search had the learns, rest, casts, and was stoping at brews. But it was the first time i had to program a graph search and my code was not optimise at all.
But since i’ve implemented the learn and the repeatable my bot was well place and i’ve got silver when the league opened.
My search was very slow and i knew i had to optimise to even consider sorting between solutions or getting path with multiple brews.
I’ve refactor during 5 days, using bitwise to store the states, heterogenous collections for the spells and brew and a unique spellbook for all the nodes (yes i had a copy of the book at every node before). But it was a pain to add the repeatable casts so i had to roll-back at a version from 5 days ago. (That was a tough call).
Silver to Gold : After the rollback i’ve add some of the optimisations i’ve come up in the 5 days.
Bitwise state, smaller nodes, one spell book.
and i was happy with the performance (Depth of 9, 30 to 50k nodes, 8k solutions found).
I start to add other features like looking what my enemy was doing etc.
But gold was hard to reach…
Then i’ve come up with a rough formula to sort my solutions and i quickly got to the top of silver.
I was about to go sleep (it was the last day at 8 in the morning) but my bot climbed out of silver and i got hype and stay awake to see him reach gold.

What help me:

I could run test offline and that was waaaaay faster that submit on CG.
Smaller type
tests

Where i’ve failed:

My node could be smaller (I’ve learn about unions after the challenge)
I didn’t use the 1s of the first turn
I didn’t profile my code (I wanted to but didn’t had the time to make it work)
i didn’t know Rust enough to see the costly operations (mallocs for example).

What i’ve like:

Love Rust, the challenge, and the tough competition.

What i didn’t like:

All into one file, ive hated this so bad, at the end i was just searching for everything. (My file was 1k lines)
The search make your bot harder to understand. (Sometime i look replays and don’t get what my bot is doing and why)
Maybe the leagues were too big.

What i’ve wish for:

An arena to make bots versus player
More chatty bots, plz put your stats into the log ><

Here is one of my matchs : (One that ive won )

Thanks to CG, and to all the competitors.

notojapko · November 25, 2020, 7:48pm

~1200 place overall

My simple approach was as follows:

at first learn some spells
- no heuristic to determine the best one
- learn the first one, not to spend money on that
- goal: learn 7 new spells
then for each move run a BFS
- no special logic here
- expand the action tree only if I can cast the spell (also check if spell is repeatable)
- goal: run till I don’t find a first potion I can brew
if I didn’t find a potion to brew in given time - cast a random spell

I tried also running the BFS until the most valuable potion was brewable, but usually I wasn’t able to do so, ending up
doing a random move.