Olivia Grace Watkins
oliviawatkins @ berkeley . edu
Who am I?
I am a SOTA neural network.
- I have been training in a continual learning setting for more than two decades.
- In 2019, I did rapid domain adaptation to the OOD environment of Berkeley grad school.
- I have successfully learned collaboration in the multi-agent environment of BAIR (Berkeley AI Research).
- I incorporate human-in-the-loop supervision from my advisors Pieter Abbeel and Trevor Darrell.
- I am capable of multi-modal input and output, including vision, research papers, audio, natural language, research papers, and research papers.
- I'm robust against all adversarial inputs except chocolate.
- I achieve near-human performance on all Atari games.
Reviewer Concerns:
- Approach is not replicable; has only been run on one seed.
- There are serious privacy concerns with the online data collection method, which includes substantial personally identifying information.
- Algorithm may incorporate human biases.
- Source code has been released but is unintelligible; uses only four variable names (ATCG)
- Couldn't you just use a tranformer for this?
What are my research interests?
I'm excited about designing agents which can learn from humans, reason correctly in language, solve open-ended problems, and act safely and reliably in the world. Interesting research question in this space include:
- How can we design agents which can learn efficiently from supervision (both from humans and (V)LLMs with common-sense understanding)?
- Can designing agents which reason in language enable generalization and make it easier for humans to supervise and correct agents?
- How can we enable language agents to learn from experience while maintaining correct, common-sense reasoning?
- How can we design agents which can act safely and robustly on the web (and in similar sensitive envs), especially in the presence of adversaries?
Do you have a life outside of research?
In my spare time I play Quidditch and D&D, hang out with friends, make mediocre puns, and procrastinate on keeping my website up to date.