Scientific Integrity Question Consider a fictional AI system M, and a scientist S who wishes to use M in their research pipeline (e.g., for analysis and/or interpretation). Functionally, M is not fully understood by S. In your opinion, is it acceptable that S uses M in their research pipeline? - ThreadSky

irisvanrooij.bsky.social • 64 days ago

Scientific Integrity Question

Consider a fictional AI system M, and a scientist S who wishes to use M in their research pipeline (e.g., for analysis and/or interpretation). Functionally, M is not fully understood by S. In your opinion, is it acceptable that S uses M in their research pipeline?

Comments

nbhansen.dk•62 days ago

if M is 100% open, static and the work is therefore reproduceable by others, i am okay with it - at least, i have seen theory used that is not fully understood but at least properly cited and attributed. That is 100% impossible with LLMs of course so for those i am gonna say "no".

pseudopigraphia.bsky.social•57 days ago

Of course M can be used. However, we all know about the hallucination problem the new AI models have, so S had better test and verify the outputs of M that he/she uses.

bwyble.bsky.social•57 days ago

I think computational modelers, myself included, typically use models with functional aspects that they do not understand that well. It is much easier to build something than to understand it. Thus, there will often be a gap between building and understanding.

cianodonnell.bsky.social•57 days ago

Should I need to understand the Mersenne Twister algorithm to use the random number generator package in Python?

imo no. I just trust it

Should I know how ANOVA works before using it? Probably yes

Point being there are *degrees* of criticality of understanding the tools you use, depending on use

nosrat.bsky.social•57 days ago

Probably yes for the first question too!

https://github.com/nosratullah/LCG-RandomGenerator

bwyble.bsky.social•57 days ago

I'd argue no. All RNGs will have quirks. Understanding how they work is not necessary provided that we know to use ones that have quirks that won't derail our research.

nosrat.bsky.social•57 days ago

My point. We should know some basic knowledge about RNGs too weather they effect our research or not. "To what extent knowledge?" then will be defined by our research questionand the degree of impact of the tool.

bwyble.bsky.social•57 days ago

Yes, the "to what extent" question is the difficult one here.

For most researchers in psychology or neuro, merely understanding that RNGs need new seeds is probably the extent that they need to understand them.

cianodonnell.bsky.social•57 days ago

Do you think those flaws would affect most scientific computing applications? I kind of assumed it would be relevant only eg for cryptography. Which is my point about the use case being relevant!

nosrat.bsky.social•57 days ago

I don't see why not but I'm not an expert on this.
But I agree with your point. We can't be knowledgeable of everything. That's one of the reasons that we collaborate. To cover our blindspots!

I've found this relevant study regarding RNGs:

https://eprint.iacr.org/2024/578.pdf

cianodonnell.bsky.social•57 days ago

If the marginal distribution of the samples is sufficiently close to uniform for my practical purpose, then that's good enough. Doesn't really matter if it's predictable by an ANN or whatever due to long-range correlations in the sequence

reid-lab.org•57 days ago

It's a legit point. On the other hand, there are methods that are commonly used (e.g., ICA) such that the user understands (hopefully) the high-level principles, without understanding the underlying low-level statistical principles.

reid-lab.org•57 days ago

One example being: ICA used for identification of artifactual EEG or fMRI components. This has been well validated, and produces sensible components that can be unambiguously identified as artifacts, without understanding how they were derived.

devezer.bsky.social•64 days ago

Setting aside all other related issues that might factor in such an assessment and sticking to the parameters of your framing, I wouldn't think it's any different from heavily relying on methods one doesn't even try to or cannot understand. I find it troubling in other contexts so that carries over.

devezer.bsky.social•64 days ago

Such an attitude and practice usually causes a lot of problems downstream that may not be immediately visible.

singhblom.com•63 days ago

It seems to be a commonly accepted scientific practice though. Psychologists rely on graduate students to categorize responses even though they cannot claim to fully understand graduate students.

devezer.bsky.social•62 days ago

People are neither tools nor models? I don't see the analogy.

singhblom.com•62 days ago

People scoring survey responses are surely used as a tool in a data processing pipeline? And I think it's a fine thing to do, you develop guidelines for how to do it and you validate that the scores follow the guidelines. That's very similar to how I imagine a lot of people will employ LLMs.

devezer.bsky.social•62 days ago

Might or might not. I've seen some terrible, irresponsible, and unverifiable applications in research for sure. It's not like a model can take responsibility for its mistakes like people can. It's the user who vouches for it. It's their job to understand why or how they can vouch for it.

samhforbes.bsky.social•62 days ago

Grad students are not tools

irisvanrooij.bsky.social•61 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lehckdgpcs2s

samhforbes.bsky.social•61 days ago

Yes that’s very telling!

emp1.bsky.social•62 days ago

Typifying people as tools is quite revealing. Tools are often defined by a specific use. Letting aside the implications of your statement, lack of understanding of an individual or class does not necessarily entail ignorance of a particular action, unless the person is reduced to the task.⭕️

rhacodactylus.bsky.social•62 days ago

when a person is involved in coding responses, there is a training process so that the means by which codes are determined is understood. and this process is detailed in the methods sections of published papers

you don't need to fully understand "people" to understand how responses are scored

singhblom.com•62 days ago

Aren't there tons of cases where different coders had different biases that the authors didn't account for? Yes, training is of course a way to try to reduce these biases but I would be very surprised if it worked perfectly, simply because coders are human beings and therefore not fully understood.

rhacodactylus.bsky.social•62 days ago

in the over century of time that researchers have engaged in qualitative social science research, people have worked hard to acknowledge and address those (and other) concerns. for instance, the rich literature on "grounded theory" approaches would be a good starting point for you

matthras.bsky.social•64 days ago

Yep, I see this when peering into non-maths/maths-adjacent disciplines and their student populace attitudes towards the maths components. There's a certain kind of (naive) ignorance due to not understanding.

remigau.bsky.social•64 days ago

Maybe I am being super negative but I see this lack of full understanding of all the methods used any scientist as the norm rather than the exception. This is just based on my personal non-systematic experience in biology, psychology, neuroscience. Happy to see any data on this.

erisianrite.com•64 days ago

For larger labs this is the norm, pretty common in smaller ones too. Medium to large labs will hire one or two dedicated devs, or will collaborate with a stats master faculty to help with that work (mine in grad did). I don’t see it as a big problem since the days of a solo PI seem long gone.

irisvanrooij.bsky.social•64 days ago

This makes sense to me, and is also how I think about it. What puzzles me is that when it comes to the use of AI systems by scientists it appears as if many of the usual criteria for quality, standards, and integrity, etc. seem to go out of the window. … I wonder why.

devezer.bsky.social•64 days ago

I wonder if it's because model-based thinking is not common in many disciplines. It's hard to start with LLMs as they're more complicated and black box than many familiar (linear) statistical models. Easier to view them as easy substitutions for whatever task/data/etc. instead of complex models.

emp1.bsky.social•64 days ago

I think it runs deeper than habit. Many colleagues do believe that data suffices, is self-revealing, and is independent of any interpretive framework. Hence, it is superfluous to have a profound understanding of how the tools that process the information work. A non-purely scientific luxury.

jo.nny.rip•62 days ago

I think the surface of affordances that LLMs provide is an important part of this too - it appears to be both tool and a data generating process. The universal text prompt input gets you to stranger places than say a regression library because of what fits in and comes out

berndporr.bsky.social•64 days ago

Replace "AI" with MATLAB toolbox. In my pet subject Brain Computer Interface most of them use one of these BCI toolboxes which do all the magic from "cleaning up" to classifying. Then they cite a paper which claims it all works. If it then turns out it was just noise "it wisnae me! It wis MATLAB."

emp1.bsky.social•64 days ago

This brings us to the topic of understanding the tools that Iris posted (see below) and my increasing belief that people's minds are returning to myth and magic constructs.

https://bsky.app/profile/emp1.bsky.social/post/3lebu2koxac2h

https://bsky.app/profile/irisvanrooij.bsky.social/post/3leamakhegk2k

ramonalvarado.bsky.social•57 days ago

I’m often puzzled as to why many in #philsci don’t get it.

ramonalvarado.bsky.social•57 days ago

I’ve been thinking about this for a while.

https://philsci-archive.pitt.edu/19384/

shescomeundone.bsky.social•56 days ago

📌

irisvanrooij.bsky.social•57 days ago

Me too. Bit depressing to be honest

alegrecortes.bsky.social•63 days ago

I'd argue that as the job of a scientist is to build theories from their interpretation of the data, it is required that they (or the team) are able to explain all the transformations that the data has overcome to support the theory. Theories cannot be grounded on non-tractable tools

tsawallis.bsky.social•64 days ago

Plenty of Ss use regression appropriately without fully understanding it. I would consider this acceptable. Is the issue you're getting at more whether M could in principle be understood by S? In the case where no Ss fully understand M, then I tend towards "unacceptable".

jedharris.bsky.social•63 days ago

Super helpful distinction. Maybe a slight refinement: Does the author know how to get more information about how M works if they need it? And do they know how to detect flaws in their application of M? If not they can't tell if they are using it in ways that will produce reliable information.

irisvanrooij.bsky.social•63 days ago

Indeed, in my scenario S would not know those things.

jedharris.bsky.social•63 days ago

Thanks! I totally agree that using methods without that background knowledge and due diligence is epistemic malpractice. And I feel your point is important and should be more central in our discourse.

Conversely this is a considerably lower bar than complete understanding.

carlbergstrom.com•62 days ago

I came here simply to mention the use of stats software, but this distinction goes a step farther and feels right to me.

spider.florist•64 days ago

Yeah, I think “fully understand’s the tool’s internal workings” might be different from “well informed wrt risk assessment and best practices for the tool”. My guess is the less common the first one is in a field, the less common the second is.

keithwilson.eu•62 days ago

The “functionally” qualification seems important here. You might not fully understand how some tool or technology works, which is OK provided it’s reliable (which AI is not). But if you don’t fully understand what it does, then you’re in trouble.

irisvanrooij.bsky.social•64 days ago

Interesting nuance. I had indeed collapsed those two scenarios in my question.

tcarpenter.bsky.social•57 days ago

Functional understanding is required but is not hard to get. And imo the scientist is responsible for stats interpretation. But there are even better places I see for AI in the pipeline, including helping refine ideas and writing code

kevinashton.bsky.social•57 days ago

Yes, but only if (a) M understands how it operates and is able to answer any question S asks about how it reached its conclusions, (b) S enables anyone else to ask M how it reach its conclusions, (c) M's source code is available for anyone to examine

andreskarjus.bsky.social•61 days ago

More practical threshold would likely be "sufficiently understood". Ppl can successfully&acceptably use without "fully" understanding eg
- stats (without getting mat mult, optimizers)
- software (w/o programming)
- screwdrivers (w/o smelting knowledge)
- RAs (can a human ever be fully understood?)

karlrohe.bsky.social•57 days ago

Consider a fictional AI system MCMC, and a scientist S who wishes to use MCMC in their research pipeline (e.g., for analysis and/or interpretation). Functionally, MCMC is not fully understood by S. In your opinion, is it acceptable that S uses MCMC in their research pipeline?

andpru.bsky.social•64 days ago

what does 'fully understood' mean?

mads100tist.bsky.social•63 days ago

People do that all the time with bioinformatic tools and everyone seems very happy to not ask any further questions about it as long as the results are cool enough

erisianrite.com•61 days ago

My lab also did this with faculty who were better statisticians than we were! Relying on people or software or machines for things you don’t fully understand is a fundamental part of doing science

blepiro.bsky.social•63 days ago

(or with statistics, too)

hdfssk.bsky.social•61 days ago

given the "AI" hype cycle’s record of misrepresentation and cynical abuses, any tool of this sort needs to be thoroughly evaluated before it can be ethically used in any serious work. if the tool can’t be understood, it can't be vetted, so there's no way it can be used ethically

resdevbcch.bsky.social•64 days ago

If M creates new information from the data, and S accepts that as “true” but S doesn’t understand the genesis of that emergent property, then probably not acceptable.

thomasfuchs.at•61 days ago

Given that there is literally no one who understands, in detail, how M actually works (which is the whole point of learning algorithms) and that it cannot provide actual analysis or interpretation, no.

It’s neither a verifiable source nor able to do these tasks.

samhforbes.bsky.social•64 days ago

Rather it strikes me that someone who wishes to use M to optimise or speed up research without fully understanding it may have priorities other than integrity.

catlikesdonuts.bsky.social•57 days ago

As a scientist who transitioned to engineering R&D, this would not fly in my line of work. I think that an academic who says this is acceptable should start holding themselves to a higher standard

johnborghi.bsky.social•63 days ago

Ouch.

As a former fMRI person, I think this happens quite often.

As a person very invested in data provenance, I’d say it would be acceptable only if S involves some collaborators who understand M more fully.

johnborghi.bsky.social•63 days ago

I suppose there might be an argument that if S reported enough about their use of M for people with sufficient expertise to evaluate things, then that may be ok… but that may be splitting hairs a bit.

benhayden.bsky.social•64 days ago

"fully understood" is bearing a lot of weight there.

Most fields use tools that are not fully understood.

For example, we don't "fully understand" what the Beck Depression Inventory is measuring, and yet it is a critical part of many research pipelines.

shrewshrew.bsky.social•64 days ago

The things we don't fully understand about the BDI result from latent mechanisms inside the humans answering the questions, not because of latent mechanisms in the inventory itself. I take "fully understand" to mean "understand how an instrument produces a number because it has no latent mechanisms"

shrewshrew.bsky.social•64 days ago

For the Beck Depression Inventory to be a good comparison here, it would have to include some double secret probation rules that give a "mondo depressed, homie" score if you answer the questions using the Konami code

shrewshrew.bsky.social•64 days ago

It's tempting to say "that would actually be awesome" but saying we are ok with latent mechanisms in our instruments means that we are ok with biases that we can't control for. Psychology and neuroscience is already rife with biases. We don't need more. We have biases in the fridge at home

andrescorrada.bsky.social•64 days ago

And that fridge is still working and doing its job! Whether biases or not are operational is not a property solely of instruments but also of their application context.

brooklynkid53bskys.bsky.social•61 days ago

so what

any real scientist is a practical person focused on getting results of hiQ
if he or she or it can get good, hiQ results faster with AI, then he or she or it will do that

irisvanrooij.bsky.social•61 days ago

What defines a high quality result?

brooklynkid53bskys.bsky.social•61 days ago

PhD Students & Postdocs read the paper, get excited, & do experiments that work, based on the paper

just like normal science; you read a paper, or hear a talk and incorporate that into your work and if the paper or talk proves wrong that person goes on your personal shi*t list

happens all the time

irisvanrooij.bsky.social•61 days ago

Haha

brooklynkid53bskys.bsky.social•61 days ago

my field is molecular biology; YMMV

Either A Kornberg or S Brenner:
anyone can publish what ever they want and science will sort it

PS: in my field, hard to find two people of greater stature then Kornberg and Brenner

david-landy.bsky.social•64 days ago

I think there is a valid concern here, but it's obscured. I don't •fully• understand my own mind, or those of my collaborators, but I use both for interpretation. I don't •fully• grock the integers. The problem arises when a conclusion hangs on the tool, not when it is used anywhere in a pipeline.

irisvanrooij.bsky.social•64 days ago

You and your collaborators are authors on your work and accountability for each of your contributions, including mistakes. And AI system isn’t. Very different. Also dehumanizing analogy. Not wise to humanize AI systems imho.

david-landy.bsky.social•63 days ago

I'm doing neither: I also point to the integers. They are pretty crucial to my pipelines, but I don't fully understand their properties (primes: complicated). None of the systems in my pipeline are ones I •fully• understand, which is my point here.

david-landy.bsky.social•63 days ago

The distinction is between things involved in a research pipeline or pathway of developing an interpretation--which can and do involve all sorts of mysterious processes--and the chain of inferential tools which sustain and explicate an interpretation. Those should not involve mysterious properties.

david-landy.bsky.social•63 days ago

So I can use my own neural systems--but the validity of an interpretation can't hang on the mysteries. I can use integers--but not the truth of Goldbach's conjecture! I can use an ai system (of whatever sort), but not if my inferences hang on properties of it that I don't understand.

david-landy.bsky.social•63 days ago

I think the unfortunately fact that that seems to happen a lot is the valuable point I see you making, and, so stated, it seems right.

j11y.io•64 days ago

Perhaps the answer is to insist on AI being accountable as humans are.

david-landy.bsky.social•63 days ago

I think I disagree: given what ai systems are now, we are very far from accountability. But part of the whole point of science is to build theories which rely as little as possible on trust, but instead can be externally verified. The challenge is that that's hard to do with current ai systems.

j11y.io•63 days ago

I don't think it's about accountability tho. It feels moot (~red herring) to talk about. Shame/Blame/Reward as social functions (the meaningful parts of accountability?) are useful for humans but if we just care about hypotheses being proven then just let AI cook and see what happens IMHO.

irisvanrooij.bsky.social•63 days ago

Accountability is not about shame/blame/reward imho.

I also answered here https://bsky.app/profile/irisvanrooij.bsky.social/post/3leboiwr5js27

irisvanrooij.bsky.social•64 days ago

Not possible. AI systems are incapable of accountability. Insisting that they need to be is replacing the focus and purpose of accountability. Maybe you mean we should insist AI systems are reliable?

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lco46jmemc2y

lewischuang.bsky.social•62 days ago

Yes. AI systems cannot be held legally liable.

https://link.springer.com/article/10.1007/s43681-022-00184-2

matanmazor.bsky.social•64 days ago

Should we wait to have a full understanding of the prefrontal cortex before we start using it as part of our research pipeline?

lingucat.bsky.social•63 days ago

No(almost). But: Many methods that are used are often not fully understood. E.g. regression analysis instead of M: While used often, it & its interpretation are seldom understood. It's 'the thing to use'. As a hype: no. As a tool for insight: yes. But for insight, understanding of M is crucial.

lingucat.bsky.social•63 days ago

The relevant question is perhaps how far the understanding goes. Are the ethical, & theoretical, & empirical implications understood? In your example, what does 'not fully understood' relate to?

irisvanrooij.bsky.social•63 days ago

In the scenario I meant, *functionally* understood, as in knowing what M does/computes (i.e., its functionality or input-output mapping), without necessarily knowing how it works in detail internally.

But I agree with you on the other dimensions being important, too.

jrkelly.bsky.social•62 days ago

S doesn't fully understand cowrie shell divination, reiki, and Box-Cox transformation. When is it appropriate to use each of those in her work, and which are most like M?

jrkelly.bsky.social•63 days ago

It would be fine if S doesn't fully understand M *if* M's inner workings were well-understood by others. Black boxes are not though.

punkrockscience.bsky.social•57 days ago

Define “functionally” here, please - do you mean they don’t know what it does, or they don’t know how it does it? Or something else?

Many scientists don’t really understand stats, for ex - e.g., they’ve been told they should use ANOVAs, so they do, but they don’t know why that vs. other methods.

punkrockscience.bsky.social•57 days ago

Or, to take a more complex question, most people don’t know what genetic alignment algorithms are doing “under the hood” - at what level of detail do they need to? Or is it more important to instead understand the pros and cons, the caveats needed, for each potential piece of software in their pipe?

irisvanrooij.bsky.social•57 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lere4hyrts2z

punkrockscience.bsky.social•57 days ago

I think many people *do* understand the functions that their AI/ML tools compute. Even for genAI, the process (broadly, tokenization -> abstraction -> extrapolation/expansion based on those tokens) is known and understood by at least some users.

punkrockscience.bsky.social•57 days ago

I guess I’m not sure where you’re going with your overall questions. They’re very broad - asking if people understand “AI” is like asking if people understand “instruments” or “statistics”.

irisvanrooij.bsky.social•57 days ago

You say people generally do understand the “what”. OK, that is fine. You can interpret my question as asking about when people do not understand the “what”.

mattsiegel.bsky.social•64 days ago

hmm, if the inner workings of M aren't understood? (e.g., what if system M was a human instead of a mechanical system)

jcdescy.bsky.social•64 days ago

I'm not a scientist, so you may stop reading at this point. But I'm an IT guy.
I would say it depends. There is a class of problems that are hard to solve, but easy to check whether the solution is correct. I think those are the best use cases for AI as of now.

stefanherzog.bsky.social•61 days ago

I think that this is a useful perspective. The paper below shows evidence that it is in exactly those verifiable tasks (hard to solve, easy/ier to verify an AI solution) where there is complementarity, that is, human+AI > max(human, AU).

stefanherzog.bsky.social•61 days ago

Fok, R., and D. S. Weld. 2024. “ In search of verifiability: Explanations rarely enable complementary performance in AI-advised decision making.” AI Magazine 45: 317–332. https://doi.org/10.1002/aaai.12182

irisvanrooij.bsky.social•64 days ago

It makes sense, if and when solutions are indeed verifiable; and if the method does not introduce (unknown) bias.

ututuy.org•57 days ago

My take: taking science as a fundamentally human endeavor, there's only very limited room for M's. There should be a uninterrupted human-understood path from data to models at least in principle. Clearly no single S can follow the full path but at any point there should be someone able to do it.

pbontrager.bsky.social•57 days ago

What if humanity knows X and wants to understand Z. If a computer can give us Y so that we can understand Z, that would be useful for science. Though I’d say that we still didn’t know Y ourselves yet.

irisvanrooij.bsky.social•57 days ago

I appreciate this take

shuniata.bsky.social•62 days ago

The answer depends on many factors, the main is the degree of understanding. Is S able to critically evaluate and verify the outputs? Then yes, since results are the goal. Is the analysis method relevant or critical for the research work? Then no, since S can't fully understand/defend what was done

anllohernan.bsky.social•61 days ago

Yes. I mean hell, half of the psychologists i know don't fully understand how linear regression works so...

seriousstats.bsky.social•57 days ago

I'd argue less than 1%. Of course that depends on what fully understand means ... very few understand say collinearity, standardized regression coefficients or R^2 for instance let alone having an internal model of linear algebra.

seriousstats.bsky.social•57 days ago

Functional understanding is a better standard but I still think there are degrees of functional understanding of say regression that depend on the granularity of your mental model for how regression works.

irisvanrooij.bsky.social•61 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lehddwuotc2s

wjst.de•57 days ago

this is the new normal

maniagnosis.bsky.social•61 days ago

To my mind, the primary integrity question does not come from the lack of understanding of M but the ethics of the construction of M. If it was unethical, then it's use is a fruit of the poisonous tree issue.

maniagnosis.bsky.social•61 days ago

On the other hand, current M's have known pitfalls, and integrity requires those to be corrected. (From what I've seen, they aren't in practice. 😡)

timhenke.bsky.social•64 days ago

Surely the question is how S uses M

I don't fully understand Google's search algorithm, but surely nobody would object to me using it to find papers, because my uncertainty cannot trickle down to false results

On the other hand, if I ask a black box questions that go straight into the paper,…

timhenke.bsky.social•64 days ago

In the examples you mentioned, analysis/interpretation, it seems clear that S's uncertainty could badly hurt the scientific quality of the work. Unless S has methods to check the answers or empirical research indicating M is robustly capable of answering these questions, it seems unacceptable to me

j11y.io•64 days ago

I'd say yes. I mean, humans aren't fully understood, yet we have them as colleagues in our pipelines.

irisvanrooij.bsky.social•64 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lebm7rdams27

zeerak.bsky.social•64 days ago

Does it matter that it’s an AI system? Ideally one understands all of the methods in your own research pipeline. But also there’s a matter of what understanding means. Eg with deterministic software, do people need to fully understand it or just have a good idea and know input-output relationship.

zeerak.bsky.social•64 days ago

Obv ideal to fully understand the system but having a working idea of its function can be sufficient. But then this all depends on application. If it serves a crucial step (ie research could not be conducted without it) then a full understanding is needed, but that applies regardless of the method.

irisvanrooij.bsky.social•64 days ago

By functional understanding I meant understand/know the input/output mapping.

zeerak.bsky.social•63 days ago

Sorry yeah, I was also just trying to work through myself (in written form on here; sorry for causing confusion!) if there was a significant difference between an AI method or, say, an interview method, in terms of expectations of understanding to be acceptable. I’m not sure that there is really.

shornik.bsky.social•61 days ago

Consider a fictional scientist S, who does not fully understand their own cognitive processes - is it acceptable that they do anything until it is completely understood?

sthornewillve.bsky.social•63 days ago

I think this is an important question beyond just research. An important thing is that LLMs are word predictors and so isn’t a robust “way of knowing” and understanding both of these is key to using it well.

Everything must be verified, driven by and passed through you, this maximises benefit.

sthornewillve.bsky.social•63 days ago

I.e. Your consciousness and representation of the world fills in the gaps left by a system that doesn’t have this. (And biases will leak in as a result.)

rhizomic.bsky.social•57 days ago

The particular community decides.

That said, even something as simple as matrix multiply isn't fully understood once it hits a machine. And that's a good thing.

There are tools we (as a community) trust more, and tools we trust less, and those evaluations change over time.

irisvanrooij.bsky.social•57 days ago

Matrix multiplication can be functionally understood qua matrix multiplication, without the algorithm and mechanics beingt fully understood. But perhaps you have a different non-understanding in mind?

rhizomic.bsky.social•57 days ago

I can claim an understanding of matrix multiply, an algorithm that implements it, and an idealized implementation of that algorithm, but once you're talking about results I think you've roped in the grotty mechanical details. If I'm reviewing a non-theoretical paper, I may decide that the authors

rhizomic.bsky.social•57 days ago

didn't sufficiently account for rounding error for their implementation on a particular architecture, and thus I don't trust their numbers. That very community- and paper-specific of course. Most of the time it doesn't matter (and I can explain why it doesn't matter). Sometimes it does, though.

rhizomic.bsky.social•57 days ago

Effectively, the community (imperfectly, inefficiently) expresses the community's idea of where those boundaries lie.

I might be putting far more emphasis on "fully understand" than you intended. If so, feel free to disregard.

rhizomic.bsky.social•57 days ago

("I can explain" was meant as "I understand why", not "I can explain it to you.")

irisvanrooij.bsky.social•57 days ago

Ah thanks. I get it. Good point.

perceptophore.bsky.social•64 days ago

Does the integrity question change if we replace “ai system” in this scenario with “grad student?”

n-t-a.bsky.social•62 days ago

Is it different from the case of oldschool Texas Instruments calculators?

glinden.bsky.social•62 days ago

To make this more concrete, I'm concerned about using ChatGPT to label data, such as for classification tasks, when the validity and variability of those labels is poorly understood, since all the other results depend on that data. I raised that concern on a recent paper:
https://bsky.app/profile/glinden.bsky.social/post/3ldguapxh6c2p

tdietterich.bsky.social•62 days ago

Yes, I would require selecting a random sample and ask multiple raters to assign labels. Assess inter-rater agreement treating the LLM as one of the raters

altibel.bsky.social•62 days ago

📌

duhe.bsky.social•64 days ago

While I share the worry, since you are posing this as a normative question, I think epistemic division of labor must be acceptable. That doesn’t mean that being oblivious to well known problems with one’s tools is acceptable, of course.

spider.florist•64 days ago

My gut response is that this maybe matters most in terms of outcomes - if S’s lack of understanding causes harm, then no, it isn’t acceptable. Im struggling to formalize an answer to the question that’s agnostic of the nature of the tool.

irisvanrooij.bsky.social•64 days ago

Is violating standards of scientific integrity not harm? For instance, say I fake my data. Is that harmful in itself, or only if the conclusions I drew turned out harmful?

(I think fake data is a violation of scientific integrity regardless of the answer)

bea-spek.bsky.social•64 days ago

When you fake your data this is an action you deliberatly do and makes you useless as a scientist.
For me I also have problems with scientists using and interpreting p-values wrongly and using regression when you don’t understand what you are doing…

spider.florist•64 days ago

Yeah, the more I think about this the more I think the issue is less “do you understand the inner workings of the thing” and more “are you using the thing responsibly as informed by an understanding of the way your use affects the results.”

spider.florist•64 days ago

That is, I don’t need every truck driver to know how an internal combustion engine works, but I do need them to know how to drive and maintain the truck safely, and to do so

Wrt fake data: using it might not result in harm but I feel it would be unreasonable to *expect that it wouldnt*.

ellia.bsky.social•57 days ago

I don’t functionally fully understand my computer but it works just fine

irisvanrooij.bsky.social•57 days ago

I assume that when you use it for specific things you functionally understand it for those things. How else can you use it reliably? 1/

irisvanrooij.bsky.social•57 days ago

For instance, if you press the ’S’ key on your keyboard, and you have a text document open, then you know the later ’S’ will appear where your cursor is. That is what I mean by functional understanding. 2/

ellia.bsky.social•57 days ago

How does this capture “fully” even remotely

irisvanrooij.bsky.social•57 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lere4hyrts2z

ellia.bsky.social•56 days ago

Again, functions are defined only at a specific grain. Therefore I don’t really see how these two cases are different.

grateless.bsky.social•57 days ago

If you type "summarize this paper" into the LLM a summary appears on the screen. I suspect this isn't the bar you are looking for, but something a lot more explanatory? Or are you suggesting that the more verificationist aspect of science should be replaced by something more process-sensitive?

irisvanrooij.bsky.social•57 days ago

This is a great example, because the text that appears is not in fact a summary. I have been able to observe this several times when other people ask LLMs to summarize my papers

benjaminharnett.com•57 days ago

5 years from now there are going to be so many retractions and notes of concern…

irisvanrooij.bsky.social•57 days ago

Yup. And if one tries to explain that these days one risks being seeing as an anti-tech party pooper

nursejustinrn.bsky.social•56 days ago

I have a few points to make. On the one hand, scientists utilize many systems and resources they don't always functionally understand. On a broader level scientist don't always fully understand probability. Maybe the mathematical theories of probability but not the fundamental reality yet 1/

nursejustinrn.bsky.social•56 days ago

scientists rely on probability to grapple with questions whose answers generate new knowledge about the world. So in this sense, of course scientists do not need to fully understand AI to use it (with transparent limitations of course) 2/

nursejustinrn.bsky.social•56 days ago

But on the other hand, perhaps the bigger issue is that a scientist might use AI for research purposes and share data for analysis and interpretation without fully understanding what happens with the data, where it's shared, and with whom it's shared. 3/

nursejustinrn.bsky.social•56 days ago

This not knowing could lead to human subject harm placing the scientist at odds with human ethics boards. Who as of yet, do not ask scientists if they are using AI for research analysis. And how would they react of the scientist revealed this? 4/

nursejustinrn.bsky.social•56 days ago

How could scientists prove mitigation if they don't understand what happens to the data they input? I would say it depends on what aspects of knowledge we are pointing to. This is a very nuanced discussion, but a fantastic one, no less. /End

milkos.bsky.social•64 days ago

It is acceptable, but functional opacity has to be countered by external evaluation of the outcomes. For example competent judges should grade random 10% of the outcomes.

aelouass.bsky.social•64 days ago

Here is another one:

Do all neuroscientists understand entirely how an MRI works?

IMHO, it is more a matter of epistemologically sound interpretation—understanding what can be concluded and what can't from the results (and that is/should be the job of people making those AI systems/tools).

aelouass.bsky.social•64 days ago

PS: I was amazed by how many people use a p-value without fully understanding how to interpret it correctly. It doesn't make their work unacceptable, though. Everybody doesn't need to be a statistician.

nicolerprause.bsky.social•63 days ago

As a statistician, I endorse this message. 😆

irisvanrooij.bsky.social•64 days ago

I’d say it is not scientifically acceptable to interpret p-values incorrectly (even if it happens on a large scale). My questions was about scientific integrity, in a normative sense; not a descriptive sense.

devezer.bsky.social•64 days ago

There are also a lot of reasons to think that scientists could have avoided a lot of costly and consequential mistakes if they collaborated with statisticians early on in the research process instead of doing stats on their own (if they don't care or afford to commit to understanding how it works).

lewischuang.bsky.social•62 days ago

Agree with collaborating early on. In my experience, most experiments are not designed to leverage the enthusiasm of statisticians. I consider this an opportunity cost.

aelouass.bsky.social•53 days ago

In my opinion, there are two layers in that question: an epistemological and a deantological one. If someone is using “ai” in a wrong way knowingly or for clearly bad reasons (e.g secure funding, for the hype), then yes we have an obvious integrity problem. That’s the deantological part.

aelouass.bsky.social•53 days ago

Then, there is epistemology. What people call ai nowadays is inductive reasoning at a huge scale. It’s new, not mature (even for ai researchers), and, it’s WIP but, really promising. If people are using it, they are using approaches that are still being developed, thus, inherently experimental.

aelouass.bsky.social•53 days ago

If it is not satisfactory at an epistemological level, it is not always clear at the moment and advances in the field will highlight that later. Is that non-integrity ? I would say no (maybe I’m mistaken).

doctorzen.net•64 days ago

“AI” is too broad a category. There are ethical problems specific to generative AI that matter.

theresecollins.bsky.social•64 days ago

If unacceptable, then this applies to ~100% of psychologists who use ML to decode differences in brain activity between conditions A and B that differentiate a psych construct. This is an entire research field we’re talking about…!

freihanddenker.bsky.social•62 days ago

One person's modus tollens is another person's modus ponens

jjgdr.bsky.social•62 days ago

So?

irisvanrooij.bsky.social•64 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3leaochckds2k

irisvanrooij.bsky.social•64 days ago

My question is intended to be normative, not descriptive

jwlockhart.bsky.social•62 days ago

I mean, I've done some work that is more descriptive than normative here but ... Yes I think "using ML to decide differences in brain activity between a and b" is bad science.

https://journals.sagepub.com/doi/10.1177/20539517231155060

thisilove.bsky.social•61 days ago

Of course it's acceptable. Do we fully understand how, say, a triple quad mass spec works? Many people know theoretically, but functionally, eh. The inherent biases of any tool can be planned around or eliminated with careful experimentation - that's literally good scientific methodology.

ktaylor.bsky.social•62 days ago

irisvanrooij.bsky.social•62 days ago

Person of a few words I see :)

notrolleys.bsky.social•62 days ago

I have a paper on this, not about scientific use in particular, but use in general. I say 'yes', with qualifications. https://philarchive.org/rec/FERIPA-3

notrolleys.bsky.social•62 days ago

The scientific case would be like sitting in a committee with different experts, and you assenting with the committee's findings despite not being able to recreate the reasoning of some other members of the committee. But this kind of case is commonplace.

notrolleys.bsky.social•62 days ago

I specifically don't talk about co-authorship because that's another can of worms. But even in research we frequently depend on the work of other people with expertise we don't share, even in the texts we cite. I think the use of AI in research is like that.

notrolleys.bsky.social•62 days ago

The brunt of the paper is developing a model comparing where you come to a (sub) conclusion on your own vs where you depend on an external process to do so, and arguing that for most cases the latter is fine. Sometimes it isn't: the point of education, e.g., is to become able to do things yourself

notrolleys.bsky.social•62 days ago

Is research like that, where we need to do everything ourselves? No, as my examples of citations and relying on different forms of expertise show. So, if we're clear what we're taking from the AI, how it fits into the larger process, and if the resulting product is worthwhile, its use is OK.

pa28.bsky.social•62 days ago

That depends on what class of AI M is in. If it derived from techniques that pre-date generative systems then maybe. If it uses any techniques developed specifically to enable generative AI then almost certainly not.

pa28.bsky.social•62 days ago

Either way there should be some expert on M available to advise as there should be for any complex equipment or technique.

kevinzollman.com•57 days ago

Obviously a lot trades on what you mean by "fully understood."

But I would claim that most scientists don't fully understand these parts of their current research pipeline:

Research assistants
Their computer
Statistical methods
Their measurement instruments

irisvanrooij.bsky.social•57 days ago

Thanks.

The emphasis was meant to be on *functionally* understood.

Also, the question is normative, not descriptive.

Hope this helps clarify

kevinzollman.com•57 days ago

Sorry I was not explicit. I presumed that if the criteria judged almost all existing science as unethical that was evidence the criteria was too strict. Do you disagree?

Can you say more about what "functional" means in this context?

irisvanrooij.bsky.social•57 days ago

To be fair, I think large parts of science do not meet my ethical standards.

I do not think “is” implies “ought”.

I also think it is worthwhile for scientists to critical reflect on our own standards. 1/

irisvanrooij.bsky.social•57 days ago

By functional understanding of an AI system I mean understanding (mathematically or conceptually) *what* is the function it computes not necessarily understanding the mechanistic or algorithmic details of “how” it is computed. 2/

kristallpirat.bsky.social•57 days ago

Die Inferenzmaschine. Muss man sie nur bedienen oder auch verstehen. Reicht es zu wissen, wie die lineare Algebra die Vektoren verrechnet oder muss man verstehen was in den 300 Dimensionen steckt? Reelle Zahlen versteht sich.

Und wie gestalten wir eine leicht verständliche Wissensbasis?

macloo.bsky.social•57 days ago

Saw your original post and I’m still a little unsure what you think S should understand about M. Can we use an image classifier as an example? S should know that M compares patterns in a digital image to patterns M has “learned” from its training data?

kevinzollman.com•57 days ago

Thanks, that's helpful. I think the closest analogy to AI (for the purposes of this discussion) is a research assistant. I don't think I functionally understand my research assistance. But I might nonetheless use them in order to, say, code text responses to a prompt.

md.ekstrandom.net•64 days ago

IMO it depends on how they use M and validation they do of its behavior and outputs.

E.g. if M is a labeling tool, and they apply it reproducibly (eg versioned local copy), and appropriately validate its output (eg IRR with human experts + check for error biases), then I wouldn’t object.

maaikeverbruggen.bsky.social•64 days ago

It depends on the type of AI and application in research

I use AI for translation. Not for text analysis, but for reading reports and papers.

But i use plenty of other computer programmes too i don't understand either

anthonybecker.bsky.social•62 days ago

Yeah, what is it being used for?

Also, how much do they functionally understand it?

If they have a trustworthy model card / specification with well-defined confusion matrix, well-understood output distributions… that kind of thing—it’s probably OK

paulmwatson.com•61 days ago

It’s not that the user doesn’t understand how M works it’s that nobody does, not even those who made M. That’s the difference and danger.

maaikeverbruggen.bsky.social•61 days ago

Sure i agree that is a huge risk and i am very critical of many AI applications! Just trying to follow the guidelines of Iris' scenario here for the thought experiment though

maaikeverbruggen.bsky.social•61 days ago

To build upon the scenario, there is a difference between not fully understanding how the machine works, not fully knowing how the machines comes to its conclusions, and not fully knowing what outcomes will be generated. (To what extent) do do they pose different risks to scientific integrity?

irisvanrooij.bsky.social•61 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lecwwoghds2g

maaikeverbruggen.bsky.social•61 days ago

Yes that is a great distinction! Like knowing the maths and theory behind statistics when using stats software

tdietterich.bsky.social•57 days ago

I think it is important that S understands WHEN M works correctly and WHEN it fails. S needs to know how to calibrate M (or its outputs), just like with any other scientific instrument. A weakness of current LLMs is that they do not quantify relevant kinds of bias and uncertainty.

tdietterich.bsky.social•57 days ago

Is that what you mean by "functional understanding"?

irisvanrooij.bsky.social•57 days ago

Yes https://bsky.app/profile/irisvanrooij.bsky.social/post/3lere4hyrts2z

doclt.bsky.social•64 days ago

“Perilous to us all are the devices of an art deeper than we possess ourselves”—JRRT

thatandromeda.bsky.social•61 days ago

Does this scientist fully understand how Excel works? Or mass spectrometers?

I think there are other reasons M may be unsuitable for their task, but this isn't a bar we hold other research tools to.

irisvanrooij.bsky.social•61 days ago

Of course one knows *functionally* how Excel works if one is to use it reliably. Or at least, those functionalities that one decides to use.

cchapman.bsky.social•63 days ago

To me, the crucial point is not "full understanding" (not strictly possible for anything*) but rather where the ethical obligation lies to understand well enough.

An AI has no ethical locus -- and (usually) has been built with stolen IP. Exactly contrary to a central goal of scientific authorship.

cchapman.bsky.social•63 days ago

* even in cases like linear regression, strictly speaking one could always insist on more understanding of its predicates. "How about calculus, do you understand all of that? And all of number theory? And ..."

IMO the question is who takes responsibility for how and why they are making claims.

cchapman.bsky.social•63 days ago

Thus, normatively, I would say using M is OK *conditional* on having "enough understanding" (accepting that responsibilty) AND that M is ethically acceptable to use at all (respect for IP, workers, etc).

Realistically, usage of LLMs seems overwhelmingly likely to fail both parts of that test.

germusthermophilus.bsky.social•64 days ago

BLAST is one of the most used tools in bioinformatics and the vast majority of its users don't understand how it works...

makingfamilyhistory.com•64 days ago

Few truly understand the tools and techniques that they use; from food supply through to air travel. So, they rely on people they “trust” and rules of thumb that they acquire through their own inexpert experience. This works poorly for new tools. It’s called the “bleeding edge” for a reason.

andrescorrada.bsky.social•63 days ago

"I'm used to that, It often seems to me that's all detective work is, wiping out your false starts, and beginning again."
"Yes, it is very true, that. And it is just what some people will not do. They conceive a certain theory, and everything has to fit into that theory. If one little fact will ...

andrescorrada.bsky.social•63 days ago

not fit it, they throw it aside. But it is always the facts that will not fit in that are significant." Hercule Poirot, Death on the Nile.

andrescorrada.bsky.social•63 days ago

If you discover that the AI is the "suspect in the room" in your research pipeline this may, itself, be a noteworthy scientific discovery. And if it is not, the AI may be reported in final work as detrimental. Eventually, we could then map all the areas where it has helped or not with research.

andrescorrada.bsky.social•63 days ago

Ultimate, paranoid-like, scientific integrity would be to consider oneself the "suspect in the room" - personal beliefs about what is correct or not in research should be subjected to empirical replication. Especially if it involves instruments one does not understand.

andrescorrada.bsky.social•64 days ago

Yes. There are many instances of this in scientific history. Normative standards could b gate keeping mechanisms.

andrescorrada.bsky.social•64 days ago

One forgotten scientist, the 19th century French physicist Regnault had to deal with contemporaries that did not believe that the concept of "temperature" was real. That "thermometers" were not simple instruments and thus required extensive theoretical understanding before they could be accepted.

andrescorrada.bsky.social•64 days ago

His answer is what we consider good Science now - produce multiple thermometers and consider their agreements and disagreements when reaching a final conclusion.
Another example is the use of the telescope for astronomical observations by Galileo. Is the Moon imperfect or the telescope?

andrescorrada.bsky.social•64 days ago

In essence, the Scientist does not summarily reject instruments because it does not understand them thoroughly. Rather, they are treated like journalists and detectives treat any of their sources - as suspects until corroborated by other pieces of information.

irisvanrooij.bsky.social•64 days ago

W.r.t. the last point, the analogy does not seem to work: AI systems are artifacts, not explananda.

As for “against method”, I’ll think amore about this. I think there is a difference between what Feyerabend argued for versus what is sold to science in AI hype … but maybe not.

irisvanrooij.bsky.social•64 days ago

First thought: AI systems themselves are algorithmic ( = method).

But will think more.

altibel.bsky.social•62 days ago

No, it is not. Cause he is doing analysis through a black box. Us scholars from the humanities hear often that we make stuff up, but we are clear about our methodology & transparent about how we interpret things, our positionality. Here S would say: it's like that cause M says. TFW is that.

mgaldino.bsky.social•64 days ago

We didn't understand well linear regression (for causality) until very recently (like a few years ago). Of course we didn't know that. Many statistical tools are like that. We don't understand them very well and yet we use it. I believe we need more structure on the question to answer it.

irisvanrooij.bsky.social•64 days ago

My question is not descriptive, but normative.

In the example of linear regression, seems to me that linear regression is functionally well understood (i.e., its input-output mapping is well-defined). Causality is a different beast. It is a concept.

mgaldino.bsky.social•64 days ago

I had this paper in mind: https://www.aeaweb.org/articles?id=10.1257/aer.20221116
I didn't know that there was the possibility of negative weights in linear regression. So, not sure we can reduce it to a problem with the concept of causality.

mgaldino.bsky.social•64 days ago

My point is that we do not fully understand very well many statistical tools. Much of what metrics research does is to show just that: some tool we thought we could use in some settings, we actually can't.

mgaldino.bsky.social•64 days ago

But I can give another example, simpler. Let's say I use Google translator to translate interviews made in a language I do not understand. Should I do it? What about hiring a translator? Is it any different?

irisvanrooij.bsky.social•64 days ago

I would definitely not use Google translate for scientific data (if that is what is meant with interviews in this context?). Maybe ask a collaborator who speaks the language or hire a translator instead. Or otherwise use a privacy sensitive non-proprietary automatic translator

irisvanrooij.bsky.social•64 days ago

Would that last point not underscore the relevance of my question 🙂

mgaldino.bsky.social•64 days ago

Absolutely. Your question is spot on. I just think we need to formalize what is that we know and what we know we don't know (unfortunately we can't know unknown unknowns, but perhaps can out some bounds?).

irisvanrooij.bsky.social•64 days ago

gustavc123.bsky.social•57 days ago

That's why biologists don't use CTI or convocal microscopy.

irisvanrooij.bsky.social•57 days ago

Interesting. Could you elaborate for a biology layperson?

gustavc123.bsky.social•57 days ago

I'm not sure I do know how convocal microscopy works

gustavc123.bsky.social•57 days ago

Taking CTI, I understand the engineering side, but I cannot handle the underlying mathematical manipulation. I could if I tried, but I actually take it on trust.
The real reason I trust these tools is their consistency with other methods. I would say that AI results come into that category.

gustavc123.bsky.social•57 days ago

If it produced revolutionary results I'd want lots of supporting evidence

chrislarsen.bsky.social•63 days ago

Hard no.

In doing testing of electrical equipment, being able to verify the results requires being able to verify all the equipment used in the testing.

I'd apply the same logic to software, so in order for S to verify the effect of system M in their research, they have to know how it works.

tdietterich.bsky.social•57 days ago

I don't think we need to know how it works. We need to know WHEN it works (under what conditions) and what its error properties are. This is just like any other instrument. We need to know how to calibrate it and interpret its output

jmh-19.bsky.social•56 days ago

Perhaps, yet not just contrast.

partickle.bsky.social•54 days ago

all models are wrong
but some are ....
profitable

neurotimes.bsky.social•61 days ago

Im just a flatfooted historian but i think the flaw in your scenario is that it assumes S understands much of anything. Most of the time we humans are just kind of scratching around. See brain science… for example.

neurotimes.bsky.social•61 days ago

In a way, it’s kind of uniquely human to act this way. It might help us see the limits of M better if we accepted it.

irisvanrooij.bsky.social•61 days ago

It is also human to fall prey to hype, scams and sunk cost effects …

neurotimes.bsky.social•61 days ago

John Ioannidis estimated in 2005 that most published research findings are false. Sort of ironic given his own record of course. But I think the more intriguing question you raise with your scenario is why did S* (who better understood M(all) make the ‘mistake’ of training the LLM on everything.

covidianlockdowner.bsky.social•29 days ago

John Ioannidis is a QUACK and a FRAUD!!! Osterholm COMPLETELY DEBUNKED, DISCREDITED, DISMANTLED, DISCREDITED, DEMOLISHED Ioannidis!

irisvanrooij.bsky.social•61 days ago

Training LLMs on everything was not a”mistake” but a way for unethical companies to exploit unpaid labour and make a lot of unearned profit. While burning through earth’s resources and not giving a damn how this affects less privileged people

lowd.bsky.social•61 days ago

I keep hearing how companies are getting rich off AI and also how companies are going broke because AI isn't useful for anything, often from the same people talking about the same companies.

cchapman.bsky.social•61 days ago

100% and it happens structurally, not necessarily with engineer intent.

E.g. Eng may be given tools to scrape, store, analyze Internet at scale but no $ for licenses nor any mention, accountability, or even path to discuss such.

The outcome is to use what they are given, i.e., tools for theft.

neurotimes.bsky.social•61 days ago

This point you are making is generally true to our relationship to technology (this is what technology is ontologically), which perhaps why life in 2024 is shallow. The internet of things is another example; Ai isn’t unique; I believe it just shows us the nature of technology progress as an ethics.

irisvanrooij.bsky.social•61 days ago

You are a historian. Maybe look into the history of AI winters, hype cycles Etc. AI has been going through this more often. Only the scale of resource burning is new(ish)

irisvanrooij.bsky.social•61 days ago

https://bsky.app/profile/abeba.bsky.social/post/3lef4innfx22d

neurotimes.bsky.social•61 days ago

Oh completely agree about their bogus-ness, I just think we still learn some thing from asking “but why are they so bad”? The inexplicable nature of Ai has created a field on explainability (as you know), and I kind of like that because things are difficult to explain generally. Not to be meta.

irisvanrooij.bsky.social•61 days ago

It is a huge waste of resources.

Better thinking from the start can help prevent this waste imho

neurotimes.bsky.social•61 days ago

I really like these technologies so take what I say with a grain of salt. However, I think distinguishing between it’s development and dissemination to a public is different from its far more intensive industrial applications. I know this is more your area than mine, but generally speaking…

neurotimes.bsky.social•61 days ago

I like the idea of trying to create Ai and thinking robots/matter. I don’t like the idea of degrading the human in favor of synthetic lifeforms, but it seems a reasonable place to go for science. The industrial applications, in contrast, seem wasteful and moreover likely to fail.

joachim.cidlab.com•58 days ago

I'm going to throw in my two cents and say that S's understanding of M is not relevant to the acceptability, but rather whether S is reasonably following the advice of other experts who have vetted the deployment of M in that kind of application

irisvanrooij.bsky.social•58 days ago

Makes sense. In your opinion, Would the expert be co-author on the work and/or would the authors minimally need to understand M enough to know who is a relevant external expert?

joachim.cidlab.com•57 days ago

I wouldn't say that's necessary, M might at some point be old and commonly used, for example. This is by analogy to other research software.

devezer.bsky.social•57 days ago

Going off of an example I came across today (p-curve analysis), sometimes practices get canonized without necessarily a good reason and those who are presumed to be experts haven't actually done their work. At some point, some people need to ask questions and break the chain of errors.

joachim.cidlab.com•57 days ago

Absolutely, critical vetting needs to continue. In a similar vein, I've seen bugs in base R and in Statistica that affected the results of common analyses, and no need to get started on misinterpretations of p-values!

danhicks.bsky.social•57 days ago

OP's question was framed in terms of individual responsibility, but doing well as an individual requires being part of a well-functioning scientific community that does things like keep checking the software for bugs

irisvanrooij.bsky.social•57 days ago

I see.

Fwiw, the OP was meant to be about present-day AI systems, I doubt they fit your scenario

joachim.cidlab.com•57 days ago

I'm happy to agree that when a system is used for some particular purpose with no vetting of its functionality for that purpose, that's a spectacularly bad idea (AI or otherwise)

irisvanrooij.bsky.social•57 days ago

https://bsky.app/profile/irisvanrooij.bsky.social/post/3lere4hyrts2z

Comments

Posting Rules

Reply