davidbau

davidbau

Postdoc researcher in deep nets and vision, ex-Googler. Believes that deep learning should be transparent.

Member Since 8 years ago

MIT @CSAIL,

Experience Points
670
follower
Lessons Completed
0
follow
Lessons Completed
17
stars
Best Reply Awards
70
repos

480 contributions in the last year

Pinned
⚡ Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
⚡ Pytorch-based tools for visualizing and understanding the neurons of a GAN. https://gandissect.csail.mit.edu/
⚡ Seeing what a GAN cannot generate. Visualizes and quantifies object classes within scenes that are outside the range of a GAN.
⚡ Quick, visual, principled introduction to pytorch code through five colab notebooks.
⚡ Code for the Proceedings of the National Academy of Sciences 2020 article, "Understanding the Role of Individual Units in a Deep Neural Network"
⚡ Chart of current COVID-19 time series data. Enables a variety of county- state- and nation-level comparisons and data exploration.
Activity
Dec
3
3 days ago
Dec
2
4 days ago
push

davidbau push davidbau/webhome

davidbau
davidbau

Fix makefile to handle people subdir.

commit sha: edb9ac569d63aad2e0e15b9695b991fd5edd7141

push time in 4 days ago
Nov
30
6 days ago
push

davidbau push davidbau/webhome

davidbau
davidbau

Add information for prospective students.

commit sha: bdee1c2684d6c701df73e69b2db05b19d61b62d7

push time in 6 days ago
push

davidbau push davidbau/webhome

davidbau
davidbau

Add some recent collaborators.

commit sha: c04ed07f427c97b4c7a2568e4340ab1a1f3e8b91

push time in 6 days ago
Nov
28
1 week ago
Activity icon
created branch

davidbau in davidbau/mithome create branch main

createdAt 1 week ago
Nov
20
2 weeks ago
Activity icon
created branch
createdAt 2 weeks ago
Activity icon
created repository
createdAt 2 weeks ago
Oct
29
1 month ago
Activity icon
issue

davidbau issue davidbau/seedrandom

davidbau
davidbau

This package is using eval(); any more modern way to avoid this?

This package is used in one of my deps which I need in my cloudflare worker but they don't allow eval() is there any more modern way to rewrite this without eval?

Uncaught EvalError: Code generation from strings disallowed for this context
  at line 2
  at line 2 in ./node_modules/seedrandom/seedrandom.js
  at line 2 in s
  at line 2 in ./node_modules/seedrandom/index.js
  at line 2 in s
  at line 2 in ./node_modules/@tensorflow/tfjs-data/dist/dataset.js
  at line 2 in s
  at line 2 in ./node_modules/@tensorflow/tfjs-data/dist/index.js
  at line 2 in s
  at line 2 in ./node_modules/@tensorflow/tfjs/dist/index.js
Activity icon
issue

davidbau issue comment davidbau/seedrandom

davidbau
davidbau

This package is using eval(); any more modern way to avoid this?

This package is used in one of my deps which I need in my cloudflare worker but they don't allow eval() is there any more modern way to rewrite this without eval?

Uncaught EvalError: Code generation from strings disallowed for this context
  at line 2
  at line 2 in ./node_modules/seedrandom/seedrandom.js
  at line 2 in s
  at line 2 in ./node_modules/seedrandom/index.js
  at line 2 in s
  at line 2 in ./node_modules/@tensorflow/tfjs-data/dist/dataset.js
  at line 2 in s
  at line 2 in ./node_modules/@tensorflow/tfjs-data/dist/index.js
  at line 2 in s
  at line 2 in ./node_modules/@tensorflow/tfjs/dist/index.js
davidbau
davidbau

Which version are you seeing the error with? Version 3.0.5 removed eval, so likely an upgrade will fix it for you.

Oct
10
1 month ago
Activity icon
issue

davidbau issue huggingface/datasets

davidbau
davidbau

the_pile_openwebtext2 produces ArrowInvalid, value too large to fit in C integer type

Describe the bug

When loading the_pile_openwebtext2, we get the error pyarrow.lib.ArrowInvalid: Value 2111 too large to fit in C integer type

Steps to reproduce the bug

import datasets
ds = datasets.load_dataset('the_pile_openwebtext2')

Expected results

Should download the dataset, convert it to an arrow file, and return a working Dataset object.

Actual results

The download works, but conversion to the arrow file fails as follows:

>>> ds = datasets.load_dataset('the_pile_openwebtext2')
Downloading and preparing dataset openwebtext2/plain_text (download: 27.33 GiB, generated: 63.86 GiB
, post-processed: Unknown size, total: 91.19 GiB) to /home/davidbau/.cache/huggingface/datasets/open
webtext2/plain_text/1.0.0/c48ec73ba3483bac673463f48f67e9a4fd8cb49a9d6ec4fb957f0b424b97cf25...
Traceback (most recent call last):
  File "/home/davidbau/.conda/envs/tenv/lib/python3.9/site-packages/datasets/builder.py", line 1133,
 in _prepare_split
    writer.write(example, key)
  File "/home/davidbau/.conda/envs/tenv/lib/python3.9/site-packages/datasets/arrow_writer.py", line
366, in write
    self.write_examples_on_file()
  File "/home/davidbau/.conda/envs/tenv/lib/python3.9/site-packages/datasets/arrow_writer.py", line
311, in write_examples_on_file
    pa_array = pa.array(typed_sequence)
  File "pyarrow/array.pxi", line 222, in pyarrow.lib.array
  File "pyarrow/array.pxi", line 110, in pyarrow.lib._handle_arrow_array_protocol
  File "/home/davidbau/.conda/envs/tenv/lib/python3.9/site-packages/datasets/arrow_writer.py", line
115, in __arrow_array__
    out = pa.array(cast_to_python_objects(self.data, only_1d_for_numpy=True), type=type)
  File "pyarrow/array.pxi", line 305, in pyarrow.lib.array
  File "pyarrow/array.pxi", line 39, in pyarrow.lib._sequence_to_array
  File "pyarrow/error.pxi", line 122, in pyarrow.lib.pyarrow_internal_check_status
  File "pyarrow/error.pxi", line 84, in pyarrow.lib.check_status
pyarrow.lib.ArrowInvalid: Value 2111 too large to fit in C integer type
## Environment info
<!-- You can run the command `datasets-cli env` and copy-and-paste its output below. -->
- `datasets` version:
  • Platform: Ubuntu 20.04
  • Python version: python 3.9
  • PyArrow version: 3.0.0
Sep
26
2 months ago
Sep
10
2 months ago
push

davidbau push davidbau/envs

davidbau
davidbau

Add custom compilation of pytorch 1.5.

commit sha: aaf65496add0c2125c6be7836cfd2047e46c6658

push time in 2 months ago
Sep
9
2 months ago
Previous