Add guide for R by david-cortes-intel · Pull Request #38 · intel/optimization-zone

david-cortes-intel · 2026-06-24T11:12:59Z

Adds a guide for optimizing R workflows.

adgubrud

LGTM

rsiyer-intel · 2026-07-01T22:01:24Z

+
+If requests mostly involve compute-heavy operations (e.g. matrix multiplications, as opposed to fetching data from online databases), it is recommended to limit the number of parallel requests to number of threads or to number of physical cores in the machine, as otherwise requests will compete for resources and this will cause slowdowns and decreased throughput. Likewise, If using [Kubernetes](https://kubernetes.io) (also known as 'k8s'), avoid allocating less than a full CPU core to a compute-heavy pod, and avoid fractional core allocations.
+
+## Data frame operations


For this section, is it possible to provide a summary/compact recommendation table/decision table (example - data size, operation type, memory constraints) that will help answer "which one should I pick?"

Added a table based on operation type. For data size, it'd be quite hard to make recommendations like that, because it depends a lot more on what operations are done with that data.

rsiyer-intel · 2026-07-01T22:11:46Z

+
+Those sparse objects will be accepted as input by many modeling-related packages, such as `glmnet`, `xgboost`, `ranger`, `rsparse` and others, which have routines to operate efficiently on them.
+
+As a general rule, sparse representations only start being advantageous when the number of non-zeros in the data is less than 10%, but the exact threshold at which switching is optimal can vary a lot by use-case. If the amount of non-zeros is less than 1% however, it is very unlikely that a regular dense data representation would be more efficient when a sparse format is supported.


Please fix following typos -

modifyin → modifying — line 3
environmnet → environment — line 86
sytem (in “sytem level”) → system — line 118
onMKL → oneMKL — line 122
apriori → a priori — line 159
PlumbeR → plumber — line 268
constitude → constitute — line 270

Fixed, but:

There's no typo in line 3.

PlumbeR is how the authors named the library being referenced.

rsiyer-intel

Thanks for the changes!

david-cortes-intel added 4 commits June 24, 2026 13:10

add guide for R

1a9e247

more details

33b0e2d

clearer example

8c82678

more details

ab26794

rsiyer-intel requested review from adgubrud and Copilot and removed request for Copilot June 26, 2026 00:04

Copilot started reviewing on behalf of rsiyer-intel June 26, 2026 00:05 View session

david-cortes-intel added 2 commits June 26, 2026 07:42

missing argument

b6090ab

more tips

8c21f01

adgubrud previously approved these changes Jun 30, 2026

View reviewed changes

rsiyer-intel reviewed Jul 1, 2026

View reviewed changes

Comment thread software/R/README.md

rsiyer-intel reviewed Jul 1, 2026

View reviewed changes

rsiyer-intel requested changes Jul 1, 2026

View reviewed changes

fix typos

1f2d7f3

david-cortes-intel dismissed adgubrud’s stale review via 1f2d7f3 July 2, 2026 08:48

david-cortes-intel added 2 commits July 2, 2026 10:59

add table version for suggested libraries

60d5804

typo

a9c743a

rsiyer-intel approved these changes Jul 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add guide for R#38

Add guide for R#38
david-cortes-intel wants to merge 9 commits into
intel:mainfrom
david-cortes-intel:rlang

david-cortes-intel commented Jun 24, 2026

Uh oh!

adgubrud left a comment

Uh oh!

rsiyer-intel Jul 1, 2026

Uh oh!

david-cortes-intel Jul 2, 2026

Uh oh!

Uh oh!

rsiyer-intel Jul 1, 2026

Uh oh!

david-cortes-intel Jul 2, 2026

Uh oh!

rsiyer-intel left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		If requests mostly involve compute-heavy operations (e.g. matrix multiplications, as opposed to fetching data from online databases), it is recommended to limit the number of parallel requests to number of threads or to number of physical cores in the machine, as otherwise requests will compete for resources and this will cause slowdowns and decreased throughput. Likewise, If using [Kubernetes](https://kubernetes.io) (also known as 'k8s'), avoid allocating less than a full CPU core to a compute-heavy pod, and avoid fractional core allocations.

		## Data frame operations


		Those sparse objects will be accepted as input by many modeling-related packages, such as `glmnet`, `xgboost`, `ranger`, `rsparse` and others, which have routines to operate efficiently on them.

		As a general rule, sparse representations only start being advantageous when the number of non-zeros in the data is less than 10%, but the exact threshold at which switching is optimal can vary a lot by use-case. If the amount of non-zeros is less than 1% however, it is very unlikely that a regular dense data representation would be more efficient when a sparse format is supported.

Uh oh!

Conversation

david-cortes-intel commented Jun 24, 2026

Uh oh!

adgubrud left a comment

Choose a reason for hiding this comment

Uh oh!

rsiyer-intel Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

david-cortes-intel Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rsiyer-intel Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

david-cortes-intel Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

rsiyer-intel left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants