Press J to jump to the feed. Press question mark to learn the rest of the keyboard shortcuts

MachineLearning

Posts

216

Posted by

u/Fireflite

8 months ago

Archived

[D] Lawsuit alleges fabricated results at Pinscreen led by Hao Li

Discussion

The filing can be found here.

These are very serious allegations: generated models results were blatantly fabricated for academic papers as well as public demonstrations. In addition, there's some pretty awful allegations of worker abuse, including an attack on the plaintiff when they attempted to confront Li about the academic misconduct.

67 comments

97% Upvoted



This thread is archived

New comments cannot be posted and votes cannot be cast

Sort by

level 1

where_else

56 points · 8 months ago

page 21, top screenshot:

anyways ... it's important that we know exactly who is using the webcam to generate the avatar
since we are just using pre-cached avatars

it's called SIGGRAPH "Real Time Live" not "Pre-Cached Live" ... this is bad

level 2

INDEX45

11 points · 8 months ago

I’m not familiar with their process but are we talking blatant fabrication here, or just “regress to a pre trained latent vector distribution” type stuff?

level 3

where_else

5 points · 8 months ago

When I read it, it looked like the demo was supposed to do rendering on the spot fromnscratch. Sadegi himself tried it and observed it took a long time, and had issues (screenshot of his message is there). That's why they had faked it.

I didn't spend time to find the conference page (or the videos?) for it, that would be helpful.

level 4

Lambdaa--

5 points · 8 months ago · edited 8 months ago

I believe their presentation at SIGGRAPH 2017 RTL is here

level 3

CuriousKittyXoXo

2 points · 3 months ago

Blatant Data Fabrication which is so shocking it is hard to believe. See for example:

151. On May 22, 2017, one day before the submission deadline, Li ordered the team, on “PinscreenTeamAll” Skype thread, including Saito, Nagano, Wei, Yen-Chun Chen, Hu, Fursund, Sun, Kung, Seo, Yu, Xiang, Stephen Chen, Zhou, and Sadeghi to fabricate the Hair Polystrip Patch Optimization process stating “we spent 1 day on it,” that is a lot, and that “if in an hour it’s not working, let’s do it manually and give up on it. I don’t think we can make it automatic.” (Exhibit E8)

level 2

CuriousKittyXoXo

2 points · 3 months ago

The amended complaint shows that this sentence was said by Jens Fursund, the CTO of Pinscreen:

196. [July 24, 2017] Fursund: “Anyway… It’s important that we know exactly who is using the webcam to generate the avatar”

197. [July 24, 2017] Fursund: “Since we’re just using pre-cached avatars”

level 1

onenuthin

47 points · 8 months ago

anyone want to wager on whether this leads to other whistleblowers coming out about other ML & AI startups that are still faking it?

Is this just an isolated personality, or a symptom of competitive market?

level 2

edwardthegreat2

43 points · 8 months ago

Probably the tip of the iceberg. Lots of "AI" companies are just mechanical Turks under the guise that they're gathering training data.

level 3

alexmlamb

10 points · 8 months ago

I actually don't think there's anything wrong with it as long as (1) they're honest that humans are doing some of the stuff behind the scenes, (2) they're not misleading scientific conferences.

level 4

neuroMiners

1 point · 8 months ago

There is a saying in SV: fake it till you make it.
The problem is that if they would acknowledge that it's not gonna work, they wouldn't get more investments. At some point you're incentivized to keep the con going.

level 5

AskMeIfImAReptiloid

2 points · 8 months ago

The Silicon Valley TV show addressed this: https://www.youtube.com/watch?v=Txl90NEl92U

level 5

alexmlamb

1 point · 8 months ago

But I think that there needs to be a line between creating the perception that you're really far along and outright fraud. For example if they presented demos and admitted (but perhaps in a way to not draw attention to it) that certain pieces are touched up by a human, they might still be able to build enthusiasm for their product, they could get around that issue without actually committing fraud.

This should also be a wakeup call to computer vision community to be better at preventing fraudulent demos.

level 3

CuriousKittyXoXo

1 point · 3 months ago

Well what is worse about this case is that Li, Pinscreen's CEO, who is leading the data fabrication is an assistant professor and has many SIGGRAPH publications which means they are all contaminated with fabrication.

level 2

PresentCompanyExcl

1 point · 8 months ago

I guess it's important to be more skeptical of academic claims, especially if there's a startup involved. Even if they come from someone at a respected institution.

level 1

havetolearn

59 points · 8 months ago

Oh my god :( Sad sad behavior. Plus, I always suspected some people make academic literature forcibly hard..here is an example: Page 83 of the document, by Li "We need to make sure that people cannot easily implement it" "maybe we add a lot of things about the hair cutting etc." So much for reproducibility :(

level 2

QuesnayJr

8 points · 8 months ago

They really don't, though, in general. One man's clear explanation is another man's "WTF is this shit?" Explaining things clearly is a skill, and not a skill that is widely distributed.

Also, people write for their peers, since that is going to be most of the audience for their paper.

level 3

RyanCacophony

15 points · 8 months ago

This is true to some degree; technical writing is a skill that is usually not picked up naturally, and taught in a class thats ignored by most CS students (if taught at all) IME. It is, in my opinion, very hard. But as the person youre responding to quoted, what you are describing is not the case:

here is an example: Page 83 of the document, by Li "We need to make sure that people cannot easily implement it" "maybe we add a lot of things about the hair cutting etc."

level 4

QuesnayJr

6 points · 8 months ago

Li is allegedly a crook, but the comment was speculating that this is the reason why technical papers are hard in general.

level 1

FujiwaraMokou

17 points · 8 months ago

Now that reads extremely bad. If anything this kind of "fake it till you hopefully make it" is really a big issue for research in such a competitive corporate setting.

level 1

Rainymood_XI

37 points · 8 months ago

I was going to highlight stuff like

[April 18, 2017] Li: “We need to make sure that people cannot easily implement it”

and

[June 21, 2017] Li: “What I mean is that it’s not easy to tell how to tweak data to get the results we want”

but then I found this, which is even worse IMHO

[February 4, 2017] Li: “One of our tasks is to map segmented hair images to 3D hairstyles

[February 4, 2017] Li: “Here is a paper that is kinda related”

[February 4, 2017] Li: “But not exactly what we want”

[February 4, 2017] Li: “Don’t share it”

[February 4, 2017] […]

[February 4, 2017] Li: [c118-f118_2-a523-paper-v1.pdf]

[March 3, 2017] Li: “Don’t share this paper”

[March 3, 2017] Li: “It’s under review”

This is ... bad ... so weird because this guy seemed like a respectable academic.

level 2

jeffbell

8 points · 8 months ago · edited 8 months ago

Isn't it normal to ask that a prepublication draft not be leaked until it has been accepted?

EDIT: Ooops. I misunderstood the messages exchanged. Thanks to /u/netw0rkf10w for setting me straight.

level 3

netw0rkf10w

17 points · 8 months ago

You don’t get it.

« Li’s academic misconduct included sharing confidential under-review scientific paper submissions from competitor research groups within Pinscreen and suggesting to look for “details that can be used.” »

He was either a reviewer or an area chair and was sharing a paper under submission from another group/competitor with his team (and asked them to not share it).