Currently tokens test are flaky. It appears `cleanup` function can raise an Error. Locally, we use `runInBand` option to run our tests and all is working fine. The mess up could appear when test are running in parallel as cleanup can remove file used in other test at any moment.