Introduce linter for deeply nested code? #1848

IndrajeetPatil · 2022-12-15T19:18:37Z

Preamble

There is a substantial body of literature showing that deeply nested code increases complexity and is difficult to understand. This code can almost always be redesigned to be simpler (for a recent demo, see this video). It would be good if we can provide a linter that lints functions that nest deeper than the specified threshold.

I realize that this sounds similar to cycloclomp_linter(), but I'd argue that the proposed linter will be slightly different. E.g., a function, especially a small one, can have a McCabe’s complexity lower than the specified threshold (say 15) and yet have nesting deeper than the specified threshold.

Default

The book below reports that the ability of programmers to comprehend a loop deteriorates significantly beyond three levels of nesting, so maybe that can be the default.

Yourdon, Edward (1986). Managing the Structured Techniques: Strategies for Software Development in the 1990s, 3d ed. New York, NY: Yourdon Press.

That said, I am not sure how outdated this literature is or if there are newer studies on this topic that suggest a different default.

Example

Here is an example I came across in a closed-source project. I have removed all the code to remove any way to identify the project and also to lay bare the nested structure of the code. It'd be nice if linter could detect code like this and flag it for redesign/refactor.

foo <- function() {
  # ...
  
  for (grade in grades) {
    # ...
    
    for (subject in subjects) {
      # ...
      
      if (nrow(subject_data) > 0) {
        # ...
        
        for (file in files) {
          # ...
          
          if (file.exists(local_doc)) {
            # ...
          } else {
            # ...
          }
          
          # ...
          
          if (file.exists(local_docx)) {
            # ...
            
            if (isTRUE(delete_doc)) { 
              # ...
            }
            
            # ...
          }
        }
      }
    }
  }
}

The text was updated successfully, but these errors were encountered:

MichaelChirico · 2022-12-15T19:39:11Z

we have an unnecessary_nesting_linter() that will get at what you're after I believe

…

On Thu, Dec 15, 2022, 11:18 AM Indrajeet Patil ***@***.***> wrote: Preamble There is a substantial body of literature showing that deeply nested code increases complexity and is difficult to understand. This code can almost always be redesigned to be simpler (for a recent demo, see this <https://www.youtube.com/watch?v=CFRhGnuXG-4> video). It would be good if we can provide a linter that lints functions that nest deeper than the specified threshold. I realize that this sounds similar to cycloclomp_linter(), but I'd argue that the proposed linter will be slightly different. E.g., a function, especially a small one, can have a McCabe’s complexity lower than the specified threshold (say 15) and yet have nesting deeper than the specified threshold. Default The book below reports that the ability of programmers to comprehend a loop deteriorates significantly beyond *three* levels of nesting, so maybe that can be the default. Yourdon, Edward (1986). *Managing the Structured Techniques: Strategies for Software Development in the 1990s*, 3d ed. New York, NY: Yourdon Press. That said, I am not sure how outdated this literature is or if there are newer studies on this topic that suggest a different default. Example Here is an example I came across in a closed-source project. I have removed all the code to remove any way to identify the project and also to lay bare the nested structure of the code. It'd be nice if linter could detect code like this and flag it for redesign/refactor. foo <- function() { # ... for (grade in grades) { # ... for (subject in subjects) { # ... if (nrow(subject_data) > 0) { # ... for (file in files) { # ... if (file.exists(local_doc)) { # ... } else { # ... } # ... if (file.exists(local_docx)) { # ... if (isTRUE(delete_doc)) { # ... } # ... } } } } } } — Reply to this email directly, view it on GitHub <#1848>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AB2BA5I4UZB7YZZAFBHHUKTWNNVJRANCNFSM6AAAAAATACWTGQ> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

MichaelChirico · 2023-10-03T17:09:08Z

In this particular case, cyclocomp_linter() will suffice.

AshesITR · 2023-10-04T06:03:59Z

Re nesting we should be careful with R6Classes.
That is one case where cyclocomp also quickly fails to provide helpful lints simply because the complexity of all class methods seems to be summed.

Here is a trivial class which reaches 4 levels of nesting:

MyClass <- R6::R6Class(
  "MyClass",
  public = list(
    hello = function(x) {
      if (missing(x)) {
        x <- "unknown"
      }
      paste0("Hello, ", x, "!")
    }
  )
)

MichaelChirico · 2023-11-18T05:45:41Z

See #2302 for some initial checks on overly-nested code.

IndrajeetPatil · 2024-10-28T07:58:09Z

I am not sure why I marked this as closed because we still don't have a way to lint deeply nested code.

Here is a good example to showcase how code can be deeply nested and still pass the McCabe complexity threshold:

library(lintr)

lint(linters = cyclocomp_linter(5L), "
if (isTRUE(a))
  if (isTRUE(b))
    if (isFALSE(c))
      if (isFALSE(d))
        i <- 1L
")
#> ℹ No lints found.

^{Created on 2024-10-28 with reprex v2.1.1}

This code should produce lint with this new linter.

It can be refactored to avoid nesting, which would make it more readable:

# option-1
if (isTRUE(a) && isTRUE(b) && isFALSE(c) && isFALSE(d)) {
  i <- 1L
}

# option-2
should_set_i <- function(a, b, c, d) {
  isTRUE(a) && isTRUE(b) && isFALSE(c) && isFALSE(d)
}

if (should_set_i(a, b, c, d)) {
  i <- 1L
}

# option-3
if (all(isTRUE(a), isTRUE(b), isFALSE(c), isFALSE(d))) i <- 1L

# option-4
if (!isTRUE(a)) return(NULL)
if (!isTRUE(b)) return(NULL)
if (!isFALSE(c)) return(NULL)
if (!isFALSE(d)) return(NULL)

i <- 1L

# etc.

IndrajeetPatil added the google-linters label Dec 15, 2022

IndrajeetPatil mentioned this issue Dec 21, 2022

Lint repeated if/else if/ else after a certain threshold? #1868

Closed

IndrajeetPatil closed this as completed Dec 7, 2023

IndrajeetPatil reopened this Oct 28, 2024

IndrajeetPatil added the new-linter label Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce linter for deeply nested code? #1848

Introduce linter for deeply nested code? #1848

IndrajeetPatil commented Dec 15, 2022

MichaelChirico commented Dec 15, 2022 via email

MichaelChirico commented Oct 3, 2023

AshesITR commented Oct 4, 2023

MichaelChirico commented Nov 18, 2023

IndrajeetPatil commented Oct 28, 2024

Introduce linter for deeply nested code? #1848

Introduce linter for deeply nested code? #1848

Comments

IndrajeetPatil commented Dec 15, 2022

Preamble

Default

Example

MichaelChirico commented Dec 15, 2022 via email

MichaelChirico commented Oct 3, 2023

AshesITR commented Oct 4, 2023

MichaelChirico commented Nov 18, 2023

IndrajeetPatil commented Oct 28, 2024