💠long term memory project failed due to high costs. Could most calls be done with a cheap model and thinking be done with a more expensive model when needed? Different tiers? How to know when a given tier is needed.
💠long term memory project failed due to high costs. Could most calls be done with a cheap model and thinking be done with a more expensive model when needed? Different tiers? How to know when a given tier is needed.