[Feature]: RTN based INT8 W8A8 scheme support

### Feature Description

INT8 W8A8 scheme support was requested.  We plan to start from adding RTN based quantization scheme first. Smoothing algo is likely needed for accuracy, will track another issue seperately  

considering deployment,  compressed-tensor format should be supported 
### Motivation and Use Case

Target for current Xeon CPU, like GNR.   

### Alternatives Considered

_No response_

### Definition of Done

_No response_

### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: RTN based INT8 W8A8 scheme support #1468

Feature Description

Motivation and Use Case

Alternatives Considered

Definition of Done

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: RTN based INT8 W8A8 scheme support #1468

Description

Feature Description

Motivation and Use Case

Alternatives Considered

Definition of Done

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions